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Ccmjugates of vascular endothelial growth factor (VEGF) linked, either directly or via a linker, to a targeted agent are provided. The 
targeted agent is a cytotoxic agent, such as a libosomc-inactivating protein (RIP) and an andsense nucleic acid, or is a therapeutic nucleic 
acid for targeted deliveiy to vascular endodwlial ccUs. The targeted agent- is attached to VEGF, or via a linker, through a chemical bond, 
or the conjugate is prepared as a chimera using techniques of recombinant DNA. The conjugates are used to target cytotoxic agents or 
therapeutic nucleotides to endothelial cells and arc particularly useful for treating solid tumors, such as Kaposi's sarcoma, and for treating 
ophthalmic disorders with WKleriytng vascular proliferation. 
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Description 

CONJUGATES OF VASCULAR ENDOTHELIAL 
GROWTH FACTOR WITH TARGETED AGENTS 

5 

Technical Field 

The present invention relates to the treatment of diseases, and more 
specifically, to the preparation of conjugates of a vascular endothelial cell growth factor 
and a targeted agent, and their use in altering the function, gene-expression, or viability 
10 of a cell in a therapeutic manner. 

Background of the Invention 

A major goal of treatment of neoplastic diseases and hyperproliferative 
disorders is to ablate the abnormally growing cells while leaving normal cells 

1 5 imtouched. Various methods are under development for providing treatment, but none 
provide the requisite degree of specificity. 

One method of treatment is to deliver toxins to appropriate targets. 
Immunotoxins and cytotoxins are protein conjugates of toxin molecules with either 
antibodies or factors which bind to receptors on target cells. Three major problems 

20 may limit the usefulness of inununotoxins. First, the antibodies may react with more 
than one cell surface molecule, thereby effecting delivery to multiple cell types, 
possibly including normal cells. Second, eveh if the antibody is specific, the antibody 
reactive molecule may be present on normal cells. Third, the toxin molecule may be 
toxic to cells prior to delivery and internalization. Cytotoxins suffer from similar 

25 disadvantages of specificity and toxicity. Another limitation in the therapeutic use of 
immunotoxins and cytotoxins is the relatively low ratio of therapeutic to toxic dosage. 
Additionally, it may be difficult to direct sufficient concentrations of the toxin into the 
cytoplasm and intracellular compartments in which the agent can exert its desired 
activity. 

30 Given these limitations, cytotoxic therapy has been attempted using viral 

vectors to deliver DNA encoding the toxins into cells. If eukaryotic vimses are used, 
such as the retroviruses currently in use, they may recombine with host DNA to 
produce infectious virus. Moreover, because retroviral vectors are often inactivated by 
the complement system, use in vivo is limited. Retroviral vectors also lack specificit>' 

35 in delivery; receptors for most viral vectors are present on a large fraction^ if not all. 
cells. Thus, infection with such a viral vector will infect normal as well as abnormal 



wo 96/06641 



2 



cells. Because of this general infection mechanism, it is not desirable for a viral vector 
to directly encode a cytotoxic molecule. 

While delivery of nucleic acids offers advantages over delivery of 
cytotoxic proteins such as reduced toxicity prior to internalization, there is a need for 
5 high specificity of delivery, which is currently unavailable with the present systems. 

In view of the problems associated with gene therapy, there is a 
compelling need for improved treatments which are more effective and are not 
associated with such disadvantages. The present invention exploits the use of 
conjugates which have increased specificity and deliver higher amounts of nucleic acids 
10 to targeted cells, while providing other related advantages. 



Snmmarv of the Invention 

The present invention generally provides conjugates of vascular 
endothelial cell growth factor (VEGF) polypeptide or a portion thereof and a targeted 

15 agent. In one embodiment of this invention, the VEGF and targeted agent are 
conjugated through a linker. Within each conjugate, there can be more than one VEGF 
and targeted agent molecule. Preferably, in the conjugates, there are between one and 
six VEGF and targeted agents, and most preferably one VEGF molecule and one 
targeted ^ent conjugated prior to dimerization. In certain embodiments, the linker is 

20 selected from the group consisting of protease substrates, linkers that increase the 
flexibility of the conjugate, linkers that increase the solubility of the conjugate, 
photocleavable linkers and acid cleavable linkers. In certain other embodiments, the 
VEGF polypeptide may be native human or bovine VEGF or VEGF, vMch is modified 
by addition of a cysteine residue or replacement of a nonessential amino acid residue 

25 within about 20 amino acids of the N-terminus or C-terminus- In yet other 
embodiments, the targeted agent is cytotoxic, preferably a ribosome inactivating 
protein, and most preferably saporin. Other cytotoxic agents include methotrexate, 
anthrocyclines. Pseudomonas exotoxin, porphyrin, or a nucleic acid. 

In another embodiment, the conjugate has the formula: targeted agent- 

30 (L)q-VEGF-(L)r -VEGF. wherein q and r, which may be the same or different, are 0 or 
1 . In yet another embodiment, the conjugate has the formula: targeted agent-(L)q- 
VEGF. 

In other aspects, methods of targeting an agent to cells bearing VEGF 
receptors, comprises conjugating the targeted agent to one or more VEGF monomers or 
35 portions thereof that bind to a VEGF receptor, whereby the conjugated targeted agent is 
internalized by the cells. In another aspect, methods of treating VEGF-mediaied 
pathophysiological conditions, comprising administering to the animal a therapeutically 
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effective amount of a conjugate between VEGF and a cytotoxic agent, are provided. In 
certain embodiments, the condition is a dermatological disorder with underlying 
vascular proliferation, a solid tumor, or an ophthabnic disorder, such as diabetic 
retinopathy, proliferative vitreoretinopathy, and pterygium. The dermatological 
5 disorder is Kaposi's sarcoma, psoriasis or macular degeneration. Methods are also 
provided to inhibit proliferation of cells bearing VEGF receptors, comprising 
contacting the cells with an eflFective amount of a VEGF targeted agent conjugate. 

In yet other aspects, methods of effecting gene therapy are provided, 
wherein cells are contacted with a conjugate having a targeted agent which is a nucleic 

10 acid, and the conjugate includes a nuclear translocation sequence linked to the targeted 
nucleic acid or VEGF. 

In yet other aspects, DNA fragments, encoding a conjugate between a 
targeted agent and VEGF are provided. In certain embodiments, the DNA conjugate 
may additionally comprise a linker. Plasmids, vectors, and host cells are also provided. 

15 In another embodiment, methods of producing conjugate of VEGF and a targeted agent 
comprising growing a culture of cells transformed with a vector containing a VEGF 
cytotoxic agent conjugate whereby DNA is transcribed and translated to produce the 
conjugate are provided. 

In other embodiments, the VEGF monomer that is modified by insertion 

20 of a cysteine residue within about 20 amino acids of the N-terminus or C-terminus, 
wherein the inserted residue replaces a nonessential residue in the unmodified VEGF 
monomer is provided. 

Pharmaceutical compositions, comprising the VEGF targeted agent 
conjugate and a physiological acceptable excipient are also provided. 

25 These and other aspects of the present invention will become evident 

upon reference to the following detailed description and attached drawings. In addition, 
various references are set forth below which describe in more detail certain procedures 
or compositions, and are therefore incorporated by reference in their entirety. 

30 Brief Descr iption of the Drawings 

Figure 1 is a Coomassie blue stained polyacrylamide-SDS gel and 

Western blot analyses of VEGF production using the pPf?^ expression system. 

Inclusion bodies were isolated from bacteria by the addition of lysozyme followed by 

centrifugation. Equal amounts of each sample were run imder reducing conditions. An 
35 antibody to an N-terminus peptide of VEGF (Oncogene Sciences) was used in the 

Western analysis. Lanes 1 and 5, VEGF 155 t=K) hours post-induction; lanes 3 and 7. 

VEGF] 21 t=0 hours post-induction: lanes 2 and 6, VEGF 155 t=2 hours post-induction: 
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lanes 4 and 8, VEGFj2i t=2 hours post-induction. Proteins of exp>ected molecular 
weights of 19.2 kD for monomeric VEGF]65 and 14.2kD for monomeric VEGF121 
were observed. 

Figure 2 is a Coomassie blue stained polyacrylamide-SDS gel analysis 
5 of VEGF121 VEGF165 under reducing and non-reducing conditions. Inclusion 
bodies were isolated and VEGF refolded and dimerized under the given conditions. 
Lanes 1 and 3, VEGF121; ^^^^ 2 and 4, VEGF155. The predicted molecular weights 
for VEGF121 ^^-2 W> and 28.4 kD for monomeric and dimeric forms respectively. 
The predicted molecular weights for VEGF 155 are 19.2 kD and 38.4 kD for monomeric 
10 and dimeric forms respectively. 

Figure 3 is a graph of the results of an acid phosphate assay, which 
measures viable cell members, showing that purified VEGF121 or VEGF 155 made in 
E, coli can stimulate proliferation of HMVEC (human microvascular endothelial cells). 
VEGF121 and VEGF165 were isolated from inclusion bodies and tested for their 
15 ability to induce proliferation of HMVEC. E, coli derived material is shown in 
comparison to either VEGF 12 1 or VEGF 155 produced in insect cells (R&D, BV). 
Both forms of VEGF produced in E. coli induce proliferation, of HMVEC in a dose 
dependent manner at concentrations as low as 10-^ ^ to lO-^^M. VEGF] 21 produced in 
E, coli is more potent than VEGF121 produced in insect cells, while VEGF j 55 made in 
20 E, coli is less active. 

Figure 4 is a Coomassie blue stained polyacrylamide-SDS gel and 
Western blot analysis of VEGF-SAP mitdtoxin production using the pPl-^ expression 
system. Inclusion bodies were isolated from bacteria by the addition of lysozyme 
followed by centrifugation. Equal amoimts of each sample were run under reducing 
25 conditions. An antibody to an N*terminus peptide of saporin was used in the Western 
analysis. Lanes 1 and 3, VEGF12I-SAP; lanes 2 and 4, VEGFi65"SAP. Proteins of 
expected molecular weights of 42.2 kD for VEGF 121 -SAP and 47.2 kD for VEGF 155- 
S AP were observed. 

Figure 5 is a Coomassie blue stained polyacrylamide-SDS gel of 
30 VEGF121-SAP run under reducing and non-reducing conditions. Inclusion bodies 
were isolated and VEGFi2i-SAP refolded imder the given conditions. The predicted 
molecular weight for VEGF j 2 1 -SAP is 42.2 kD under reducing conditions. 

Figure 6 is a graph depicting inhibition of protein synthesis in a cell-free 
system. The effect of VEGF121-SAP on protein synthesis in a cell-free luciferase 
35 system was compared to that of SAP. 

Figure 7 is a graph showing that VEGF155-SAP inhibits proliferation of 
HMVEC (human microvascular endothelial cells) in a dose dependent maimer. CCSV. 
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chemical conjugate VEGF165-SAP; FPSV, SAP-VEGF121 made in £. cali from 
inclusion bodies; VEGF]2i» "isect cell derived. 

Detailed Description of the Invention 

5 Definitions 

Unless defined otherwise, all technical and scientific temis used herein 
have the same meaning as is commonly understood by one of skill in the art to which 
the subject matter herein belongs. 

The "amino acids" are identified according to their well-known, 

10 three-letter or one-letter abbreviations. The nucleotides, which occur in the various 
DNA fragments, are designated with the standard single-letter designations used 
routinely in the art. 

As used herein, to "bind to a receptor" refers to the ability of VEGF to 
detectably bind to such receptors as assayed by standard in vitro assays. For example, 

15 binding measures the capacity of a VEGF conjugate, VEGF monomer, or VEGF dimer 
to recognize the VEGF receptor on vascular endothelial cells, such as an aortic vascular 
endothelial cell line using a procedure such as the one described in Moscatelli (J. Cell 
Physiol 757:123-130, 1987). Briefly, cells are grown to subconfluence and incubated 
in appropriate buffer with radioiodinated VEGF dimer in the presence of various 

20 concentrations of the VEGF monomer or dimer or VEGF conjugate of interest. 
Binding affinity is measured by counting the membrane fraction that is solubilized in a 
suitable buffer containing a detergent, such as in 0.5% Triton X-1 00 in PBS (pH 8.1). 

As used herein, "biological activity" refers to the in vivo activities of a 
compoimd or physiological responses that result upon in vivo administration of a 

25 compoimd, composition or other mixture. Biological activity, thus* encompasses 
therapeutic effects and pharmaceutical activity of such compounds, compositions and 
mixtures. Such biological activity may, however, be defined with reference to 
particular in vitro activities, as measured in a defined assay. Thus, for example, 
reference herein to the biological activity or reactivity of VEGF, a dimer thereof, 

30 monomer, or fragment thereof, or other combination of VEGF monomers and 
fragments, refers to the ability of the VEGF to bind to cells bearing VEGF receptors 
and internalize a linked agent. Such activity is typically assessed in vitro by linking the 
VEGF (dimen monomer or fragment) to a cytotoxic agent, such as saporin. contacting 
cells bearing VEGF receptors, such as aortic endothelial cells, with the conjugate and 

35 assessing cell proliferation or growth. In vivo activity may be assessed using 
recognized animal models, such as the mouse xenograft model for anti-tumor activit\' 
(see. eg,. Beitz et al. (1992) Cancer Research 52:227-230; Houghton ei al. (1982) 
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Cancer Res. '/2:535.539; Bogden ct al. (1981) Canc^ (Philadelphia) 48:\0-20; 
Hoogenhout et al. (1983) Jni. J. Radial. OncoL Biol. Phys. 9:871-879; Stastny et al. 

( 1 993) Cancer Res. 53:5740-5744). 

As used herein, a "conjugate- refers to a molecule that contains at least 
one VEGF moiety and at least one targeted agent that are linked directly or via a linker 
and that are produced by chemical coupling methods or by recombinant expression of 
chimeric DNA molecules to produce fusion proteins. 

As used herein, the term "cytotoxic agent" refers to a molecule capable 
of inhibiting cell function. The agent may inhibit cell growth, differentiation or 
proliferation or be toxic to cells. The term includes agents whose toxic effects are 
mediated only when transported into the cell and also those whose toxic effect is 
mediated at the cell surface. A variety of cytotoxic agents can be used and include 
those that inhibit protein synthesis and those that inhibit expression of certain genes 

essential for cell growth or survival. 

1 5 As used herein, "DNA encoding a VEGF peptide or polypeptide" refers 

to any of the DNA fragments set forth herein as coding such peptides, to any such DNA 
fragments known to those of skill in the art, any DNA fragment that encodes a VEGF 
that binds to a VEGF receptor and is internalized thereby. Such a DNA molecule may 
be isolated from a human cell library using any DNA fragment that encodes any of the 

20 VEGF peptides set forth in SEQ ID NOs. 25-28 or any DNA fragmem that may be 
produced from any of the preceding DNA fragments by substitution of degenerate 
codons. It is understood that once the (Complete amino acid sequence of a peptide, such 
as a VEGF peptide, is available to those of skill in this art, it is routine to substitute 
degenerate codons and produce any of the possible DNA fragments that encode such 

25 peptide. It is also generally possible to synthesize DNA encoding such peptide based 

on the amino acid sequence. 

As used herein, a "fiision protein" refers to a polypeptide that contains at 

least two components, such as VEGF and a targeted agent or VEGF and linker, and is 
produced by expression of DNA in a host cell. 

As used herein, "nucleic acids" refer to RNA or DNA tiiat are intended 
as targeted agents, which include, but are not limited to. DNA encoding tiierapeutic 
proteins, fragments of DNA for co-suppression, DNA encoding cytotoxic proteins, 
antisense nucleic acids and other such molecules. Reference to nucleic acids includes 
duplex DNA. single-stranded DNA. RNA in any form, including triplex, duplex or 
35 single-stranded RNA, anti-sense RNA. polynucleotides, oligonucleotides, single 
nucleotides and derivatives thereof. 



30 
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Nucleic acids may be composed of the well-known 
deoxyribonucleotides or ribonucleotides composed of the bases: adenosine, cytosine, 
guanine, thymidine, and uridine. As well, various other nucleotide derivatives and non- 
phosphate backbones or phosphate derivative backbones may be used. 
5 For example, because normal phosphodiester oligonucleotides (referred 

to as PO oligonucleotides or type I; see structure, below, where X = 0) are sensitive to 
DNA- and RNA-specific nucleases, several resistant types of oligonucleotides have 
been developed {see, e.g.. International Application WO 93/23570, which is based on 
07/881,255, filed May 11, 1992; International Application WO 93/15742, which is 
10 based on 07/833,146, filed February 10, 1992; Wagner et al. (1993) Science 260:1510- 
1514; U.S. Patent No. 5^18,088, U.S. Patent No. 5,175,269; U.S. Patent No. 
5,109,124; Carter et al. (1993) Br, J. Cancer 67:869-876); these include types II-IV: 

HocH,Q B in which B is a nucleotide base; and X is OEt in 



phosphotriester (type II), X is Me in methylphosphonate 
Q o (type III; referred to as MP oligos); and X is S in 

x^^^o ^ ai phosphorothioate (referred to as PS oligos; U.S. Patent No. 

5,218,088 to Gorenstein et al. describes a method for 



oligonucleotides have been the focus of most investigation. 

CH. B 



preparation of PS oligos). Presently, MP and PS 

^ J T o ? As used herein, the terai "VEGF" refers to 

any polypeptide tt^t, either as a monomer or dimer, binds to 
^ a VEGF receptor and is transported into the cell by virtue of 
its interaction with the recq>tor. A polypeptide that is "reactive" with the receptor 
25 binds to the receptor and is internalized. VEGF refers to peptides having amino acid 
sequences of native VEGF pol3^peptide monomers, as well as VEGF polypeptides 
modified by amino acid substitutions, deletions, insertions or additions in the native 
protein, but alone or linked to a targeted agent retains the ability to bind to a VEGF 
receptor and to be internalized in a cell bearing such receptor. Such polypeptides 
30 include, but are not limited to human VEGF121, human VEGF165, human VEGFI89. 
human VEGF206, bovine VEGF120, bovine VEGFim, bovine VEGF188. bovine 
VEGF205. and homodimers and heterodimers of any VEGF monomer or monomers. In 
addition; peptides reactive with VEGF receptors that are isolated by phage display 
(U.S. Patent No. 5^23,409 and 5,403.484) are encompassed as well. It is understood 
35 that differences in amino acid sequences can occur among VEGFs of different species 
as well as among VEGFs from individual organisms or species and that such minor 
allelic variations or variations among species are intended to be encompassed by 
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reference to VEGF herein. As used herein a "portion of a VEGF" refers to a fragment 
or piece of VEGF that is sufficient, either alone or as a dimer with another fragment or 
a VEGF polypeptide, to bind to a VEGF receptor and internalize a linked targeted 
agent. 

5 Muteins of VEGF include, but are not limited to, those produced by 

replacing one or more of the cysteines with serine as herein or those that have any other 
amino acids deleted or replaced. Typically, muteins will have conservative amino acid 
changes, such as those set forth below in Table 1 . DNA encoding such muteins will, 
unless modified by replacement of degenerate codons, hybridize under conditions of at 
10 least low stringency to DNA encoding a VEGF (SEQ ID NOs. 25-28) or an exon 
thereof (SEQ ID NOs. 16-24). VEGF may be isolated from natural sources or be made 
synthetically, such as by recombinant means or chemical synthesis. 

As used herein, "VEGF-mediated pathophysiological condition" refers 
to a deleterious condition characterized by or caused by proliferation of cells that are 
1 5 sensitive to VEGF mitogenic stimulation. 

As used herein, "VEGF receptors" refer to receptors that react with a 
naturally-occurring member of the VEGF family of proteins and transport it into a cell 
bearing such receptors. Included among these are the>ty-like tyrosine kinase receptor 
(FLT) and the kinase insert domain-containing receptor (KDR) {see, e.g.. International 
20 Application WO 92/14748, which is based on U.S. Applications Serial No. 08/657,236, 
de Vries et al. (1992) Science 255:989-91; Terman et al. (1992) Biochem. Biophys. Res. 
Commun: 7«7:1579-1586; Kendall et ak- (1993) Proc. Natl. Acad. Sci. USA 90:10705- 
1 0709; and Peters et al. (1 993) Proc. Natl. Acad. Sci. USA 90:S9 1 5-89 1 9). 

As used herein, a "targeted agent" is any agent that is intended for 
25 internalization by linkage to VEGF, and that upon internalization alters or affects 
cellular metabolism, growth, activity, viabUity or other property or characteristic of the 
cell. The targeted agents include proteins, polypeptides, organic molecules, drugs, 
nucleic acids and other such molecules. As used herein, to target s targeted agent, such 
as a cytotoxic agent, means to direct it to a cell that expresses a selected receptor by 
30 linking the agent to a polypeptide reactive with a VEGF receptor. 

As used herein, a "therapeutic nucleic acid" describes any nucleic acid 
used in the contest of invention that modify gene transcription or translation. This term 
also includes nucleic acids that bind to sites on proteins and to receptors. It includes, 
but is not limited to the following types of nucleic acids: nucleic acids encoding a 
35 protein, antisense RNA, DNA intended to form triplex molecules, extracellular protein 
binding oligonucleotides and small nucleotide molecules. A therapeutic nucleic acid 
may ser\'e as a replacement for a defective gene or encode a therapeutic product, such 



wo 96/06641 PCTAJS95/10973 



as TNF or a cytoxic molecule, such as saporin. The therapeutic nucleic acid may 
encode all or a portion of a gene, and may function by recombining with DNA already 
present in a cell, thereby replacing a defective portion of a gene. It may also encode a 
]X>rtion of a protein and exert its effect by virtue of co-suppression of a gene product 

5 

A. Vascular endothelial growth factors 

1. Polypeptides reactive with a VEGF receptor 

Vascular endothelial growth factors (VEGFs) were identified by their 
ability to directly stimulate endothelial cell growth, but do not appear to have mitogenic 

10 effects on other types of cells. VEGFs also cause a rapid and reversible increase in 
blood vessel permeability. The members of this family have been referred to variously 
as vascular endothelial growth factor (VEGF), vascular permeability factor (VPF) and 
vasculotropin (see, e.g., Plouet et al., EMBO J. «:3801-3806, 1989). Herein, they are 
collectively referred to as VEGF. 

15 VEGF was originally isolated from a guinea pig heptocarcinoma cell 

line, line 10, (see, e.g., U.S. Patent No. 4,456,550) and has subsequently been identified 
in himians and in normal cells. It is expressed during normal development and in 
certain normal adult organs. Purified VEGF is a basic, heparin-binding, homodimeric 
glycoprotein that is heat-stable, acid-stable and may be inactivated by reducing agents. 

20 DNA sequences encoding VEGF and methods to isolate these sequences 

may be found primarily in U.S. Patent No. 5^40,848, U.S. Patent No. 5,332,671, U.S. 
Patent No. 5,219,739, U.S. Patent No. 5,194,596, and Houch et al., Mol Endocrin, 
5:180,1991. 

VEGF family members arise from a single gene organized as eight 
25 exons and spanning approximately 14 kb in the hunuui genome. Four molecular 
species of VEGF result from alternative splicing of mRNA and contain 121, 165, 189 
and 206 amino acids. The four species have similar biological activities, but differ 
markedly in their secretion pattems. The predominant isoform secreted by a variety of 
normal and transformed cells is VEGF j 55. Transcripts encoding VEGF] 2 1 and 
30 VEGFig9 are detectable in most cells and tissues that express the VEGF gene. In 
contrast, VEGF2O6 >s less abundant and has been identified only in a human fetal liver 
cDNA library. VEGF121 is a weakly acidic polypeptide that lacks the heparin binding 
domain and, consequently, does not bind to heparin. VEGFig9 and VEGF2O6 are 
more basic than VEGF 155 and bind to heparin with greater affinity. Although not 
35 every identified VEGF isoform binds heparin, all isoforms are considered to be 
heparin-binding growth factors within the context of this invention. 
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The secreted isoforms» VEGF121 and VEGF]55 are preferred VEGF 
proteins. VEGF121 is particularly preferred. The longer isofonns» VEGF 1 89 and 
VEGF206» are almost completely bound to the extracellular matrix and need to be 
released by an agent, such as urokinase, suramin, heparin or heparinase, and plasmin. 
5 Other preferred VEGF proteins contain various combinations of VEGF exons, such that 
the protein still binds VEGF receptor and is internalized. It is not necessary that a 
VEGF protein used in the context of this invention either retain any of its in vivo 
biological activities, such as stimulating endothelial cell growth, or bind heparin. It is 
only necessary that the VEGF protein or fragment thereof bind the VEGF receptor and 
10 be internalized into the cell bearing the receptor. However, it may be desirable in 
certain contexts for VEGF to manifest certain of its biological activities. For example, 
if VEGF is used as a carrier for DN A encoding a molecule useful in woimd healing, it 
would be desirable that VEGF exhibit vessel permeability activity and promotion of 
fibroblast migration and angiogenesis. It will be apparent from the teachings provided 
1 5 within the subject application which of the activities of VEGF are desirable to maintain. 

VEGF promotes an array of responses in endothelium, including blood 
vessel hyperpermeability, endothelial cell growth, angiogenesis, cell migration and 
enhanced glucose transport. VEGF stimulates the growth of endothelial cells from a 
variety of sources (including brain capillaries, fetal and adult aortas, and umbilical 
20 veins) at low concentrations, but is reported to have no effect on the growth of vascular 
smooth muscle cells, adrenal cortex cells, keratinocytes, lens epithelial cells, or BHK- 
21 fibroblasts. VEGF also is a potemt polypeptide regulator of blood vessel function; it 
causes a rapid but transient increase in microvascular permeability without causing 
endothelial cell damage or mast cell degranulation, and its action is not blocked by 
25 antihistamines. VEGF has also been reported to induce monocyte migration and 
activation and has been implicated as a tumor angiogenesis factor in some human 
gliomas. Also, VEGF is a chemoattractant for monocytes and VEGF has been shown 
to enhance the activity of the inflammatory mediator tumor necrosis factor (TNF). 

Quiescent and proliferating endothelial cells display high-affinit>' 
30 binding to VEGF, and endothelial cell responses to VEGF appear to be mediated by 
high affinity cell stuface receptors. Two tyrosine kinases have been identified as VEGF 
receptors. The first, known as fins-Uke tyrosine kinase or FLT is a receptor tyrosine 
kinase that is specific for VEGF. In adult and embryonic tissues, expression of FLT 
mRNA is localized to the endothelium and to populations of cells that give rise to 
35 endothelium. The second receptor KDR (human kinase insert domain-containing 
receptor), and its mouse homologue FLK-1, are closely related to FLT. The 
KDR/FLK-1 receptor is expressed in endothelium during the fetal growth stage, during 
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earlier embryonic development, and in adult tissues. In addition, messenger RNA 
encoding FLT and KDR have been identified in tumor blood vessels and specifically by 
endothelial cells of blood vessels supplying glioblastomas. Similarly, FLT and KDR 
mRNAs are upregulated in timior blood vessels in invasive human colon 
5 adenocarcinoma, but not in the blood vessels of adjacent normal tissues. 

VEGF suitable for use herein also includes any polypeptide or fragment 
of a VEGF protein that retains the ability, either as a monomer or as part of a dimer, to 
bind to a VEGF receptor and to be internalized by a cell bearing such receptor. In 
addition, VEGF include any combination of peptides encoded by the exons set forth in 

10 SEQ ID NOs. 16-24 that retains the requisite receptor binding and internalization 
activities. Amino acid sequence variations in VEGF, including allelic variations and 
conserv'ati ve amino acid substitutions, such as those set forth in TABLE 1 . thai do not 
alter its ability to bind to VEGF receptors and to be internalized by cells upon such 
binding are suitable for use in the present invention. 

15 The various VEGF isoforms that result from alternative splicing of RNA 

transcribed from a VEGF gene (see, e.g.y U.S. Patent No. 5,219,739 to Tischer et al.; 
U.S. Patent No. 5,194,596 to Tisher et al.; U.S. Patent No. 5,240,848 to Keck et al.; 
International PCT Application No. WO 90/13649, which is based on U.S. applications 
nos. 07/351,361, 07/369,424, 07/389,722, to Genentech, Inc., and U.S. applications 

20 Serial Nos. 07/351,361, 07/369,424, 07/389,722; European Patent Applications 
EP 0 506 477 Al and EP 0 476 983 A 1 to Merck & Co.; Houck et al. (1991) MoL 

"K 

Endo. 5:1806-1814; see also SEQ ID NOs;''l8-28; see, also SEQ ID Nos. 86-89, for 
modified forms produced herein) are also suitable for use in the present invention. 

Any polypeptide that is reactive with a VEGF receptor may be used in 

25 the present invention. VEGF conjugates preferably include at least two VEGF 
monomers in an antiparallel orientation. Dimer formation occurs when VEGF 
monomers are mixed under physiological or other appropriate conditions. Also, 
expression of tandem repeats of VEGF as fusion proteins, with or without linkers 
separating the monomers, should result in dimers upon expression of DNA encoding 

30 the VEGF fusion proteins. 

VEGF may be isolated from any mammalian source, including human, 
bovine and murine sources, although human is preferred. The VEGF polypeptides 
include those that, when dimerized, are mitogenic to vascular endothelial c^lls. 
Mitogenic activit>\ however, is not required for the VEGF moieties used herein. It is 

35 sufficient that the polypeptide, bind to a VEGF receptor and internalize a linked agent. 
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2. Modifications of VEGF 

The preferred VEGF molecules are those that are set forth in SEQ ID 
Nos. 25-28 or peptides that have minor sequence variations of the peptides. Such 
minor sequence variations include, but are not limited to, minor allelic or species 
variations and insertions or deletions of residues, particularly cysteine residues. 
Suitable conservative substitutions of amino acids are known to those of skill in this art 
and may be made generally without altering the biological activity of the resulting 
molecule. Those of skill in this art recognize that, in general, single amino acid 
substitutions in non-essential regions of a polypeptide do not substantially alter 
biological activity {see, e.g., Watson et al. Molecular Biology of the Gene, 4th Edition, 
1987, The Benjamin/Cummings Pub. Co., p.224). Such substitutions arc preferably 
made in accordance with those set forth in TABLE 1 as follows: 



TABLE 1 



Original Residue 



Conservative substitution 



Ala (A) 



Gly; Scr 



Arg(R) 



Lys 



Asn (N) 



Gin; His 



Cys(C) 
Gin (Q) 
Glu (E) 
Gly (G) 
His (H) 
Hed) 
Lcu(L) 
Lys(K) 
Met (M) 
Phe (F) 
Ser (S) 
Thr(T) 
Trp(W> 
Tyr(Y) 
Val (V> 



Sen neutral amino acid 

Asn 

Asp 

Ala; Pro 

Asn; Gin 

Leu; Val 

He; Val 
Arg; Gin; Glu 
Leu; Tyn lie 
Met; Leu: Tyr 

Thr 

Scr 

Tyr 

Trp: Phe 

He; Leu 



15 
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Other substitutions are also permissible and may be determined 
empirically or in accord with known conservative substitutions. Any such modification 
of the polypeptide may be effected by any means knovm to those of skill in this art. 

VEGF peptides include those having SEQ ID NOs. 25-28, and versions 
5 thereof that lack the leader sequence (amino acids 1-26 in any of SEQ ID NOs. 25-28). 
including VEGF precursors that include all or a part of the signal sequence, or modified 
forms of VEGF that retain the requisite activities (the ability to bind to a VEGF 
receptor and internalize a linked targeted agent). Members of the VEGF protein family, 
including human VEGF] 21, human VEGF] 65, human VEGF] 89, human VEGF206* 

10 are preferred. VEGF j 21 is particularly preferred. As provided herein VEGF121 has 
SEQ ID NO. 25, see, also SEQ ID NOs. 86 and 88 for modified fomis, and also is 
formed from EXONS I-V and VIII (SEQ ID NOs. 16-20 and 24); VEGF 1 65 has SEQ 
ID NO. 26, see, also SEQ ID NOs. 87 and 89 for modified forais,, and also is formed 
from EXONS I-V, VII and VIII (SEQ ID NOs. 16-20, 23 and 24); VEGF 189 has SEQ 

15 ID NO. 27, and also is fomied from EXONS I-VII and VIII (SEQ ID NOs. 16-21, 23 
and 24); and VEGF2O6 has SEQ ID NO. 28, and also is foraied from EXONS 1-VI, the 
insert between EXONS VI and VII (see, SEQ ID NO. 22), and EXONS VII and VIII 
(SEQ ID NOs. 16-24). It is noted that in the sequence of EXON V SEQ ID NO. 2, the 
second Lys-encoding codon AAG, has been reported as AAA. Consequently, in the 

20 VEGF 1 65, 1 89, and 206 forms, which contain this codoa. the sequence, reported here 
with the AAG codon, can also be AAA, Molecules, synthetic or naturally occurring, 
that may be fomied from combinations of SEQ ID NOs. 16-24 (or allelic or minor 
conservatively substituted variations thereof) that possess the ability to bind to a VEGF 
receptor and internalize a linked targeted agent are intended for use herein. If 

25 necessary, such combinations of exons may be identified empirically by synthesizing 
the molecule and testing it, using assays described herein or any other assays known to 
those of skill in this art, for the ability, either as a monomer, or preferably as a dimer, to 
bind to a VEGF receptor and internalize a linked targeted agent. 

Mutation may be effected by any method known to those of skill in the 

30 art. including site-specific or site-directed mutagenesis of DNA encoding the protein 
and the use of DNA amplification methods using primers to introduce and amplif>' 
alterations in the DNA template, such as PGR splicing by overlap extension (SOE). 
Site-specific mutagenesis is typically effected using a phage vector that has single- and 
double-stranded forms, such as M13 ptu^e vectors, which are well-known and 

35 commercially available. Other suitable vectors that contain a single-stranded phage 
origin of replication may be used {see, e.g., Veira et al. (1987) Meih. Enzymol. 13:3). 
In ^neral. site-directed mutagenesis is performed by preparing a single-stranded vector 
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that encodes the protein of interest (i.e., a member of the VEGF family or a cytotoxic 
molecule, such as a saporin). An oligonucleotide primer that contains the desired 
mutation within a region of homology to the DNA in the single-stranded vector is 
annealed to the vector followed by addition of a DNA polymerase, such as E. coli 
5 polymerase I Klenow fragment, which uses the double stranded region as a primer to 
produce a heteroduplex in which one strand encodes the altered sequence and the other 
the original sequence. The heteroduplex is introduced into appropriate bacterial cells 
and clones that include the desired mutation are selected. The resulting altered DNA 
molecules may be expressed recombinantly in appropriate host cells to produce the 

1 0 modified protein. 

The SOE method uses two amplified oligonucleotide products, which 

have complementary ends as primers and which include an altered codon at the locus at 
which the mutation is desired, to produce a hybrid product A second amplification 
reaction that uses two primers that anneal at the non-overlapping ends amplify the 

1 5 hybrid to produce DNA that has the desired alteration. 

In certain embodiments, the heterogeneity of preparations may be 
reduced by mutagenizing VEGF to replace reactive cysteines, leaving, preferably, only 
one available cysteine for reaction. VEGF is modified by deleting or replacing a site(s) 
that causes the heterogeneity. Such sites are typically cysteine residues that, upon 

20 folding of the protein, remain available for interaction with other cysteines or for 
interaction with more than one cytotoxic molecule per molecule of VEGF peptide. 
Thusi such cysteine residues do not include any cysteine residue that are required for 
proper folding of VEGF or for retention of the ability to bind to a VEGF receptor and 
internalize. For chemical conjugation, one cysteine residue that, in physiological 

25 conditions, is available for interaction, is not replaced because it is used as the site for 
linking the cytotoxic moiety. The resulting modified VEGF is conjugated with a single 

species of cytotoxic conjugate. 

Alternatively, the contribution of each cysteine to the ability to bind to 
VEGF may be determined empirically. Each cysteine residue may be systematically 

30 replaced with a conservative amino acid change (see Table L above) or deleted. The 
resulting mutein is tested for the requisite biological activity, the ability to bind to 
growth factor receptors and internalize. If the mutein retains this activity, then the 
cysteine residue is not required. Additional cysteines are systematically deleted and 
replaced and the resulting muteins are tested for activity. Each of the remaining 

35 cysteine residues may be systematically deleted and/or replaced by a serine residue or 
other residue that would not be expected to aher the structure of the protein. The 
resulting peptide is tested for biological activity. If the cysteine residue is necessary for 
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retention of biological activity it is not deleted; if it not necessary, then it is preferably 
replaced with a serine or other residue that should not alter the secondary structure of 
the resulting protein. In this manner the minimum number and identity of the cysteines 
needed to retain the ability to bind to a vascular endothelial growth factor receptor and 
5 internalize may be determined. It is noted, however, that modified or mutant heparin- 
binding growth factors may exhibit reduced or no proliferative activity, but may be 
suitable for use herein, if they retmn the ability to target a linked cytotoxic agent to cells 
bearing receptors to which the unmodified heparin*binding growth factor binds and 
result in internalization of the cytotoxic moiety. Monomeric forms of VEGF121 
10 contains 9 cysteines and each of VEGFi65, VEGFi89 and VEGF2O6 contain 7 
additional cysteine residues in the region not present in VEGF121. Any of the 7 are 
likely to be non-essential for targeting and internalization of linked cytotoxic agents. 
RecenUy, the role of Cys-25, Cys-56, Cys-67, Cys-101, and Cys-145 in dimerization 
and biological activity was assessed (Claffeiy et al., Biochem, Biophys. Acta J 246:1-9, 

15 1995). Dimerization requires Cys-25, Cys-56, and Cys-67. Substitution of any one of 
these cysteine residues resulted in secretion of a monomeric VEGF, which was inactive 
in both vascular permeability and endothelial cell mitotic assays. In contrast, 
substitution of Cys 145 had no effect on dimerization, although biological activities 
were somewhat reduced. Substitution of Cys-101 did not result in the production of a 

20 secreted or cytoplasmic protein. Thus, substitution of Cys-145 is preferred. 

For chemical conjugates, the VEGF monomers are preferably linked via 
non-essential cysteine residues to the linkers or to the targeted agent. VEGF that has 
been modified by introduction of a Cys residue at or near one terminus, preferably the 
N-terminus is preferred for use in chemical conjugation (see Examples for preparation 

25 of such modified VEGF). For use herein, preferably the VEGF is dimerized prior to 
linkage to the linker and/or targeted agent. Methods for coupling proteins to the 
linkers, such as the heterobifunctional agents, or to nucleic acids, or to proteins are 
known to those of skill in the art and are also described herein. 

It appears that all of the cysteines that are shared among the four VEGF 

30 monomers. VEGFi2h VEGFi65. VEGF189 and VEGF2O6 are required. Other 
cysteines that are present in VEGF 155, VEGF] 89 arid VEGF2O6* that are not present in 
VEGF121* n^ay be modified, and the resulting modified monomer tested for ability to 
form dimers and for the requisite biological activities. 

In particular, the VEGF molecules exemplified herein (SEQ ID NOs. 25- 

35 28) have cysteines at positions 52, 77, 83, 86, 87, 94, 128, and 130 in all VEGF 
monomers, and elsewhere in all monomers except for VEGF121. It appears that the 
cysteines at residues 77, 86. 87, and 130 are required for intrachain binding and. thus. 
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should not be replaced. In order to decrease the potentiral for aggregate formation, 
when monomers, other than VEGF12I, are used it may be desirable to replace the 
cysteine residues at positions other than 52, 77, 83, 86, 87, 94, 128, and 130, 
particularly, those in the heparin binding domain in VEGFi65» VEGFi89 and 
5 VEGF2O6 tnay be replaced with a conservative substitution, such as with a serine 
residue. Any replacements, however, should be checked for retention of the requisite 
binding and internalization properties. Each cysteine residue may be systematically 
replaced with a conservative amino acid change or deleted. The resulting mutein is 
tested for the requisite biological activity, the ability to bind to VEGF receptors and 
1 0 internalize linked targeted moieties. If the mutein retains this activity, then the cysteine 
residue is not required. Additional cysteines are systematically deleted and replaced 
and the resulting muteins are tested for activity. In this manner the minimum number 
and identity of the cysteines needed to retain the ability to bind to a VEGF receptor and 
internalize may be determined. 
1 5 The VEGF polypeptide may also be modified by addition of one or more 

cysteine residues at or near the C- or N-terminus, preferably the N-terminus, in order to 
render it more amenable to chemical conjugation by providing a readily available non- 
essential cysteine residue. VEGF has been modified herein by addition of Cys residues 
at or near the N-temiinus in order to render them more amenable for chemical 
20 conjugation. Any VEGF may be modified for use herein by replacement of one or 
more cysteine residues that are not required for binding to a VEGF receptor and 
internalization of the targeted agent. Tl»se modified forms of VEGF are particularly 
suitable for chemical conjugation to linkers and/or targeted agents. 

VEGF polypeptides may be isolated by methods known to those of skill 
25 in the art or may be prepared by expression of DNA encoding a VEGF protein (^ee, 
e.g., Peretz et al. (1992) Biochem. Biopf^s. Res, Commun, 752:1340-1347; U.S. Patent 
No. 4,456,550 to Dvorak et al.; U.S. Patent No. 5.219,739 to Tischer et al.; U.S. Patent 
No. 5,194,596 to Tisher et al.; U.S. Patent No. 5^40,848 to Keck et al.; International 
PCT Application No. WO 90/13649, which is based on U.S. applications serial nos. 
30 07/35 1 361 , 07/369,424, 07/389,722. to Genentech, Inc., and any U.S. Application Nos. 
07/351,361, 07/369,424, 07/389.722; European Patent Applications EP 0 506 477 Al 
and EP 0 476 983 Al to Merck & Co.; Houck et al. (1991) MoL Endo 5:1806-1814: 
see also SEQ ID Nos. 18-28 herein). It is understood herein that the key property of 
any VEGF polypeptide or fragment thereof is the ability, either as monomer or as a 
35 dimer, to bind to VEGF receptors and to be internalized by cells bearing such receptors. 
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B. Targeted agents 

1. Cytotoxic agents 

Cytotoxic agent refers to a molecule capable of inhibiting cell function. 
Cytotoxic agents include any agent that, upon internalization, by a eukaryotic cell, 
5 inhibits growth or proliferation of the cell, either by killing the cell or inhibiting a 
metabolic pathway, transcription, or translation such that cell proliferation slows or 
stops. Any agent that, when internalized inhibits or destroys cell growth, cell 
proliferation or other essential cell functions is suitable for use herein. Cytotoxic 
agents include ribosome inactivating proteins, small metabolic inhibitors, antisense 

1 0 nucleic acids, toxic drugs, such as anticancer agents^ and small molecules, such as light 
activated porphyrins. Ribosome inactivating proteins, such as saporin, are the 
preferred cjnotoxic protein agents for use herein and nucleic acids are the preferred 
non-peptide agents. 

Such cytotoxic agents, include, but are not limited to. saporin, the ricins, 

1 5 abrin and other RIPs, Pseudomonas exotoxiix, inhibitors of DN A, RN A or protein 
synthesis, including antisense nucleic acids and other metabolic inhibitors that are 
known to those of skill in this art. Saporin is preferred, but other suitable RIPs include, 
but are not limited to, ricin, ricin A chain, maize RIP, gelonin, diphtheria toxin and 
diphtheria toxin A chain (see, e.^., U.S. Patent No. 4,675,382), trichosanthin, tritin, 

20 pokeweed antiviral protein (PAP), mirabilis antiviral protein (MAP), Dianthins 32 and 
30, abrin, monordin, bryodin, bryodin2 (PCT appHcation WO 95/11977), shiga, 
cytotoxically active fragments of cytoxins and others known to those of skill in this art 
{see, e.g., Barbieri et al. (1982) Cancer Surveys 7:489-520 and European published 
patent application No. 466,222, incorporated herein by reference, which provide lists of 

25 numerous RIPs and their sources; see also, U.S. Patent No. 5,248,608). 

The selected cytotoxic agent is, if necessary, derivatized to produce a 
group reactive with a cysteine on the selected VEGF. If derivatization results in a 
mixture of reactive species, a morio-deriyatized form of the cytotoxic agent can be 
isolated and then conjugated to the selected VEGF. 

30 

a. Ribosome inactivating proteins 

Ribosome-inactivating-proteins (RIPs), which include ricin, abrin and 
saporin, are plant proteins that catalyticaliy inactivate eukaryotic ribosomes. RIPs 
inactivate ribosomes by interfering with the prptein elongation step of protein synthesis. 
35 For example, the RIP saporin (hereinafter also referred to as SAP) has been shown to 
enzymatically inactivate the 60S ribosome by cleavage of the N-glycosidic bond of the 
adenine at position 4324 in the rat 28S ribosomal RNA (rRNA). Some RIPs. such as 



wo 96/06641 



18 



the toxins abrin and ricin, contain two constituent chains:' a cell-binding chain that 
mediates binding to cell surface receptors and internalization of the molecule; and an 
enzymatically active chain responsible for protein synthesis inhibitory activity. Such 
RIPs are type II RIPs. Other RIPs, such as the saporins, are single chains and are 

5 designated type 1 RIPs, Because such RIPs lack a cell-binding chain, they are far less 
toxic to whole cells than the RIPs that have two chains. 

Several structurally related saporins have been isolated from seeds and 
leaves of the plant Saponaria officinalis (soapwort). Among these, SAP-6 is the most 
active and abundant^ representing 7% of total seed proteins. Saporin is very stable, has 

10 a high isoelectric point, does not contain carbohydrates, and is resistant to denaturing 
agents, such as sodiimi dodecyl sulfate (SDS), and a variety of proteases. The amino 
acid sequences of several saporin-6 isoforms from seeds are known and there appear to 
be families of saporin RIPs differing in a few amino acid residues. Because saporin is a 
type I RIP, it does not possess a cell-binding chain. Consequently, its toxicity to whole 

1 5 cells is much lower than the other toxins, such as ricin and abrin. When internalized by 
eukaryotic cells, however, its cytotoxicity is 1 00- to 1 000-fold more potent than ricin A 
chain. 

Saporin is preferred herein. SO-4 and S06 are prefenred saporin 
molecules. SAP-6 (also called SO-6) is particularly preferred. The saporin 

20 polypeptides include any of the isoforms of saporin that may be isolated from 
Saponaria officinalis or related species or modified fomi that retain cytotoxic activity. 
Such modified forms have amino acid substitutions, deletions, insertions or additions 
but still express substantial ribosome-inactivating activity. Purified preparations of 
saporin are frequently observed to include several molecular isoforms of the protein. It 

25 is understood that differences in amino acid sequences can occur in saporin from 
different species as well as between saporin molecules from individual organisms of the 
same species. In particular, such modified saporin may be produced by niodifyiiig the 
DNA encoding the protein {see, e.g., published International PCT Application WO 
93/25688 (Serial No. PCT/US93/05702), United Stales Application Serial No. 

30 07/901.718; see. also. U.S. Patent Application No. 07/885.242 filed May 20, 1992. and 
Patent No. 1231914, granted in Italy on January 15, 1992) by altering one or more 
amino acids or deleting or inserting one or more amino acids, such as a cysteine that 
may render it easier to conjugate to VEGF or other cell surface binding protein. Any 
such protein, or portion thereof, that, when conjugated to VEGF as described herein, 

35 exhibits cytotoxicity in standard in vitro or in vivo assays within at least about an order 
of magnitude of the saporin conjugates described herein is contemplated for use herein. 
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Thus, the SAP used herein includes any protein that is isolated from 
natural sources or that is produced by recombinant expression (see, e,g,j copending 
published International PCT Application WO 93/25688 (Serial No. PCT/US93/05702) 
and United States Application Serial No. 07/901.718, filed June 16, 1992). 
5 Some of the DNA molecules provided herein encode saporin that has 

substantially the same amino acid sequence and ribosome*inactivating activity as that 
of saporin-6 (SO-6), including any'of four isoforms, which have heterogeneity at amino 
acid positions 48 and 91 (see, e.g,, Maras et al., Biochem, Internat. 27:631-638, 1990, 
and Barra et al,, Biotechnol Appl Biochem, 7i:48-53, 1991; GB Patent 2,216,891 B 

10 and EP Patent 89306106). Other suitable saporin polypeptides include other members 
of the multi-gene family coding for isoforms of saporin-type ribosome-inactivating 
proteins including SO-1 and SO-3 (Fordham-Skelton et al., Mol Gen, Genet 227:134- 
138, 1990), SO-2 {see, e.g., U.S. Application Serial No. 07/885,242, which 
corresponds to GB 2,216,891; ^ee, also, Fordham-Skelton et al., MoL Gen, Genet. 

15 229:460-466, 1991), SO-4 {see, e.g., GB 2,194,241 B; see, also, Lappi et al., Biochem. 
Biophys, Res Commun. J 29:934-942, 1985) and SO-S {see, e g,, GB 2,194,241 B; see, 
also, Montecucchi et al., Int, J. Peptide Protein Res, 35:263-267, 1989). 

b. Nucleic acids encoding other ribosome-inactivating proteins 
20 and cytocides 

In addition to saporin discussed above, other cytocides that inhibit 
protein synthesis are useful in the present invention. The gene sequences for these 
cytocides may be isolated by standard methods, such as PGR, probe hybridization of 
genomic or cDNA libraries, antibody screenings of expression libraries, or clones 

25 obtained from commercial or other sources. The DNA sequences of many of these 
cytocides are well known, including ricin A chain (Genbank Accession No. X02388); 
maize ribosome-inactivating protein (Genbank Accession No. L26305); gelonin 
(Genbank Accession No. L12243; PCT Application WO 92/03155; U.S. Patent No. 
5.376,546; diphtheria toxin (Genbank Accession No. KOI 722); trichosanthin (Genbank 

30 Accession No. M34858); tritin (Genbank Accession No. D13795); pokeweed antiviral 
protein (Genbank Accession No. X78628); mirabilis antiviral protein (Genbank 
Accession No. D90347); dianthin 30 (Genbank Accession No. X59260); abrin 
(Genbank Accession No. X55667); shiga (Genbank Accession No. Ml 9437); and 
Pseudomonas exotoxin (Genbank Accession Nos. KOI 397, M23348). 

35 DNA encoding SAP or any cytotoxic agent may be used in the 

recombinant methods provided herein. In instances in which the cytotoxic agent does 
not contain a cysteine residue, such as instances in which DNA encoding SAP is 
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selected, the DNA may be modified to include a cysteine codon. The codon may be 
inserted into any locus that does not reduce or reduces by less than about one order of 
magnitude the cytotoxicity of the resulting protein may be selected. Such locus may be 
determined empirically by modifying the protein and testing it for cytotoxicity in an 
5 assay, such as a cell-firee protein synthesis assay. The preferred loci in SAP for 
insertion of the cysteine residue is at or near the N-terminus (within about 20 residues, 
preferably 1 0 residues, of the N-terminus). 

2* Expression of cytotoxic agents 

10 Host organisms include those organisms in ^^ch recombinant 

production of heterologous proteins have been carried, out and in which the cytotoxic 
agent, such as saporin is not toxic or of sufficiently low toxicity to permit expression 
before cell death. Presently preferred host organisms are strains of bacteria. Most 
preferred host organisms are strains of £. co/x, particularly, BL21(DE3) cells (Novagcn, 

15 Madison, WI). 

The DNA encoding the cytotoxic agent, such as s^x>rin protein, is 
introduced into a plasmid in operative linkage to an appropriate promoter for expression 
of polypeptides in a selected host organism. The presently preferred saporin proteins 
are saporin proteins that have been modified by addition of a Cys residue or 

20 replacement of a non-essential residue at or near the amino- or carboxy 1 teraiinus of the 
saporin with Cys. Saporin, such as that of SEQ ID No. 7 has been modified by 
insertion of Met-Cys residue at the N-terminus is preferred. It may additionally be 
modified by replacement of the Asn or He residue at positions 4 and 10, respectively, 
with cysteine (see Example 4). The DNA fragment mcoding the saporin may also 

25 include a protein secretion signal that functions in the selected host to direct the mature 
polypeptide into the periplasm or culture medium. The resulting saporin protein can be 
purified by methods routinely used in the art, including, methods described hereinafter 
in the Examples. 

Methods of transfomiing suitable host cells, preferably bacterial cells. 
30 and more preferably E, coli cells, as well as methods applicable for culturing said cells 
containing a gene encoding a heterologous protein, are generally known in the art. See, 
for example, Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY. 

The DNA construct encoding the saporin protein is introduced into the 
35 host cell by any suitable means, including, but not limited to transformation employing 
plasmids. bacterial phage vectors, transfection, electroporation, lipofection. and the 
like. The heterologous DNA can optionally include sequences, such as origins of 
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replication that allow for the extrachromosomal maintenance 6f the saporin-containing 
plasmid, or can be designed to integrate into the genome of the host (as an alternative 
means to ensure stable maintenance in the host). 

Positive transformants can be characterized by Southern blot analysis 
5 (Sambrook et al, (1989) Molecular Cloning: A Laboratory Manual^ Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY) for the site of DNA integration; 
Northern blots for inducible-promoter-responsive saporin gene expression; and product 
analysis for the presence of saporin*containing proteins in either the cytoplasm, 
periplasm; or the growth media. 

10 Once the saporin-encoding DNA fragment has been introduced into the 

host cell, the desired saporin-containing protein is produced by subjecting the host cell 
to conditions under which the promoter is induced, whereby the operatively linked 
DNA is transcribed. In a preferred embodiment, such conditions are those that induce 
expression from the E. coli lac opeton. The plasmid containing the DNA encoding the 

1 5 saporin-containing protein also includes the lac operator region within the promoter and 
may also include the lac I gene encoding the lac repressor protein (lacI8) (see, e.g., 
Muller-Hill et al. (1968) Proc. Natl. Acad, Set USA 59:1259-12649). The lac repressor 
represses the expression from the lac promoter uritil induced by the addition of IPTG in 
an amount sufficient to induce transcription of the DNA encoding the saporin- 

20 containing protein. 

The expression of saporin in E. coli is, thus accomplished in a two-stage 
process. In the first stage, a culture of t^ansformed E, coli cells is grown under 
conditions in M^ch the expression of the saporin-containing protein within the 
transforming plasmid, preferably encoding a saporin, such as described in Example 4, is 

25 repressed by virtue of the lac repressor. In this stage cell density iiK:reases. When an 
optirhiun density is reached, the second stage commences by addition of IPTG, which 
prevents binding of repressor to the Operator thereby inducing the lac promoter and 
transcription of the saporin-encoding DNA. 

In a preferred embodiment, the promoter is the T7 RNA polymerase 

30 promoter, which may be linked to the lac operator and the E, coli host strain includes 
DNA encoding T7 RNA polymerase operably linked to the lac operator and a promoter, 
preferably the lacUV5 promoter. A preferred plasmid is pET 1 la (Novagen, Madison* 
WI), which contains the T71ac promoter, T7 terminator, the inducible £. coli lac 
operator, and the lac repressor gene. The plasmid pET 15b (Novagen. Madison, WI), 

35 which contains a His-Tag^^ leader sequence (SEQ. ID No. 36) for use in purification 
with a His colunm and a thrombin cleavage site that permits cleavage following 
purification over the column, the T7-lac promoter region and the T7 terminator, has 
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been used herein for expression of saporin. Addition of IPTG induces expression of the 
T7 RNA polymerase and the T7 promoter, which is recognized by the T7 RNA 
polymerase. 

Transformed strains, which are of the desired phenotype and genotype, 

S are grown in fermentors by suitable methods well known in the art. In the first, or 
growth stage, expression hosts are cultured in defined minimal medium lacking the 
inducing condition, preferably IPTG. When grown in such conditions, heterologous 
gene expression is completely repressed, which allows the generation of cell mass in 
the absence of heterologous protein expression. Subsequent to the period of growth 

10 under repression of heterologous gene expression, the inducer, preferably IPTG, is 
added to the fermentation broth, thereby inducing expression of any DNA operatively 
linked to an IPTG-responsive promoter (a promoter region that contains lac operator). 
This last stage is the induction stage. 

The resulting saporin-contatning protein can be suitably isolated from 

15 the other fermentation products by methods routinely used in the art, e.^., using a 
suitable affinity column as described in the Examples; precipitation with ammoniimi 
sulfate; gel filtration; chromatography, preparative flat-bed iso-electric focusing; gel 
electrophoresis, high performance liquid chromatography (HPLC); and the like. A 
method for isolating saporin is provided in Example 1 (see, also Lappi et al. ((1985) 

20 Biochem, Biophys, Res, CommurL, 729:934-942). The expressed saporin protein is 
isolated from either the cytoplasm, periplasm, or the cell culture mediimi (see. 
discussion below and Examples). / 

3. Porphyrins 

25 Porphyrins are well known light activatable toxins that can be readily 

cross-linked to proteins {see, e.g., U.S. Patent No. 5,257,970; U.S. Patent No. 
5,252,720; U.S. Patent No. 5,238,940; U.S. Patent No. 5,192,788; IJ.S. Patent No. 
5,17K749: U.S. Patent No. 5,149,708; U.S. Patent No. 5,202,317; U.S. Patent No, 
5,217,966; U.S. Patent No. 5,053,423; U.S. Patent No. 5,109,016; U.S. Patent No. 

30 5,087,636; U.S. Patent No. 5,028,594; U.S. Patent No. 5,093349; U.S. Patent No. 
4,968,715; U.S. Patent No. 4,920,143 and International AppUcation AVO 93/02192). 

Porphyrins are conjugated to proteins by direct covalent bonds using, for 
example, a carbodiimide. Linkage may be effected by treatment of VEGF with 
l-ethyl-3-(3-dimethylamino propyl) carbodiimide in the presence of a reaction mediimi 

35 such as DMSO. For other methods see U.S. Patent No. 4,968,715. The porphyrin- 
VEGF conjugates may be administered topically or systemically. Activation of the 
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poTphyiin is by irradiating light chosen to match the maximum absorbance of the 
porphyrin-type photosensitizer. 

4* Nucleic acids for targeted delivery 

5 The conjugates provided herein are also designed to deliver nucleic acids 

to targeted cells. The nucleic acids include those intended to deliver a cytotoxic signal 
to a cell or to modify expression of genes and thereby effect genetic therapy. Examples 
of nucleic acids include antisense RNA, DNA, ribozymes and oligonucleotides that 
bind proteins. The nucleic acids can also include RNA trafficking signals, such as viral 

10 packaging sequences (see, e.g., Sullenger et al. (1994) Science 262:1566-1569). The 
nucleic acids also include DNA molecules that encode intact genes or that encode 
proteins useful for gene therapy or for effecting cell cytotoxicity. Especially of interest 
are DNA niolecules that encode an enzyme that resiilts in cell death or renders a cell 
susceptible to cell death upon the addition of another product. For example, saporin is 

15 an enzyme that cleaves rRNA and inhibits protein synthesis. Other enzymes that 
inhibit protein synthesis are especially well suited for the present invention. Other 
enzymes may be used where the enzyme activates a compound with little or no 
cytotoxicity into a toxic product. 

DNA (or RNA) that may be delivered to a cell to effect genetic therapy 

20 includes DNA that encodes tumor-specific cytotoxic molecules, such as tumor necrosis 
factor, viral antigens and other proteins to jretKler a cell susceptible to anti-cancer 
agents, and DNA encoding genes, such as the defective gene (CFTR) associated with 
cystic fibrosis (see, e.g^y International Application WO 93/03 709, which is based on 
U.S. Application Serial No. 07/745,900; and Riordan et al. (1989) Science 245:1066- 

25 1 073), to replace defective genes. 

Nucleic acids and oligonucleotides for use as described hereiri can be 
synthesized by any method known to those of skill in this art (jee. e.g., WO 93/01286, 
which is based on U.S. Application Serial No. 07/723,454; U.S., Patent No. 5,218,088; 
U.S. Patent No. 5J 75,269; U.S. Patent No. 5,109,124). Identification of 

30 oligonucleotides and ribozymes for use as antisense agents as well selection of I>NA 
encoding genes for targeted delivery for genetic therapy, as is well Avitiun the skill in 
this art. For example, the desirable properties, lengths and other characteristics of such 
oligonucleotides are well known. Antisense oligonucleotides are designed to resist 
degradation by endogenous nucleolytic enzymes and include, but are not limited to: 

35 phosphorothioate, methylphosphonate, sulfone, sulfate, ketvL phosphorodithioate. 
phosphoramidate, phosphate esters, and other such linkages (see. e.g., Agrwal et al., 
Tetrehedron Lett. 25:3539-3542 (1987): Miller et al., J. Am, Chem, Soc, 93:6657^665 
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(1971); Stec et al., Tetrehedron Lett 26:2191-2194 (1985); Moody et al., Nucl Acids 
Res. 72:4769-4782 (1989); Uznanski et al., NucL Acids Res. (1989); Lctsinger et al.. 
Tetrahedron 40:131 -Wi (1984); Eckstein, Annu. Rev, Biochem, 54367-402 (1985); 
Eckstein, Trends Biol Set 7^:97-100 (1989); Stein In: OUgodeoxynucleotides 
5 Antisense Inhibitors of Gene Expression, Cohen, Ed, Macmillan Press, London, pp. 97- 
1 17 (1989); Jager et al.. Biochemistry 27:7237-7246 (1988)). 

a. Antisense nucleotides 

Antisense nucleotides are oligonucleotides that bind in a sequence- 

10 specific manner to nucleic acids, such as mRNA or DNA. When bound to mRNA that 
has complementary sequences, antisense prevents translation of the mRNA {see, e.g.. 
U.S. Patent No. 5,168,053 to Altman et al.; U.S. Patent No. 5,190,931 to Inouye, U.S. 
Patent No. 5,135,917 to Burch; U.S. Patent No. 5,087,617 to Smith and Clusel et al. 
(1993) NucL Acids Res. 27:3405-3411, which describes dumbbell antisense 

15 oligonucleotides). Triplex molecules refer to single DNA strands that bind duplex 
DNA forming a colinear triplex molecule and thereby prevent transcription (see, e.g,^ 
U.S. Patent No. 5,176,996 to Hogan et al., which describes methods for making 
synthetic oligonucleotides that bind to target sites on duplex DNA). 

Particularly useful antisense nucleotides and triplex molecules are 

20 molecules that are complementary or bind to the sense strand of DNA or mRNA that 
encodes an oncogene, such as bFGF, int-2, hst-l/K-FGF, FGF-5, hst-2/FGF-6, FGF-8. 
Other useful antisense oligonucleotides include those that are specific for IL-8 {see, 
e.g., U.S. Patent No. 5,241,049; and Intemational applications WO 89/004836; WO 
90/06321; WO 89/10962; WO 90/00563; and WO 91/08483, and the corresponding 

25 U.S. applications for descriptions of DNA encoding IL-8 and amino acid sequences- of 
IL-8), which can be linked to bFGF for the treatment of psoriasis, anti-sense 
oligonucleotides that are specific for nonmuscle myosin heavy chain and/or c-myb (see, 
e.g., Simons et al. (1992) Circ. Res. 70:835-843; WO 93/01286, which is based on U.S. 
application Serial No. 07/723,454: LeClerc et al. (1991) J. Am Coll Cardiol 17 

30 (2 Suppl y4>:105A; Ebbecke et al. (1992) Basic Res. Cardiol 57:585-591), which can 
be targeted by an FGF to inhibit smooth mmcle cell proliferation, such as that 
following angioplasty and thereby prevent restenosis or inhibit viral gene expression in 
transformed or infected cells. 

35 Ribozymes 

A ribozyme is an RNA molecule that specifically cleaves RNA 
substrates, such mRNA, and thus inhibit or interfere with cell growth or expression. 
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There are at least five classes of ribozymes that are known that are involved in the 
cleavage and/or ligation of RNA chains. Ribozymes can be targeted to any RNA 
transcript and can catalytically cleave such transcript (^ee, e.^., U.S. Patent No. 
5,272,262; U.S. Patent No. 5,144,019; and U.S. Patent Nos. 5,168,053, 5J 80,81 8, 
5 5,116,742 and 5,093,246 to Cech et al., which described ribozymes and methods for 
production thereof). Any such ribosome may be linked to the growth factor for 
delivery to VEGF-receptor bearing cells. 

The ribozymes may be delivered to the targeted cells, such tumor cells 
that express a receptor to which VEGF binds and upon binding is internalized, as DNA 
10 encoding the ribozyme linked to a eukaryotic promoter, such as a eukaryotic viral 
promoter, generally a late promoter, such that upon introduction into the nucleus, the 
ribozyme will be directly transcribed. In such instances, the construct will also include 
a nuclear translocation sequence (NTS; see Table 2, below), generally as part of the 
growth factor or as part of a linker between the growth factor aiid linked DNA. 

15 

c* Nucleic acids encoding therapeutic products 

Among the DNA that encodes therapeutic products contemplated for use 
is DNA encoding correct copies of defective genes, such as the defective gene (CFTR) 
associated with cystic fibrosis (see, e.g,^ International Application WO 93/03709, which 

20 is based on UlS; Application Serial No. 07/745,900; and Riorxlan et al. (1989) Science 
2-#5: 1066- 1073), and anticancer agents, such as tumor necrosis factors, and cytotoxic 
agents, such as saporin to VEGF-receptor befuing cells. The conjugate should include 
an NTS. If the conjugate is designed such that the VEGF and linked DNA is cleaved in 
the cytoplasm, then the NTS should be included in a portion of the linker that remains 

25 bound to the DNA, so that, upon internalization, the conjugate will be trafficked to the 
nucleus. The nuclear translocation sequence (NTS) may be a heterologous sequence or 
a may be derived from the selected growth factor. 

d. Other nucleic acids 

30 Extracellular protein binding oligonucleotides refer to oligonucleotides 

that specifically bind to proteins. Small nucleotide molecules refer to nucleic acids that 
target a receptor isite. 

+ 

e. Coupling of nucleic acids to proteins 

35 To effect chemical conjugation herein, the VEGF protein is linked to the 

nucleic acid either directly or via one or more linkers. Methods for conjugating nucleic 
acids, at the 5* ends, 3' ends and elseuliere, to the amino and carboxyl termini and other 
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sites in proteins are known to those of skill in the art (for a review see e,g.y Goodchild, 
(1993) In: Perspectives in Bioconjugate Chemistry ^ Mears, Ed., American Chemical 
Society, Washington, D.C. pp. 77-99). For example, proteins have been linked to 
nucleic acids using ultraviolet irradiation (Sperling ct al, (1978) Nucleic Acids Res, 
5 5:2755-2773; Fiser et al. (1975) FEBS Lett. 52:281-283), bifimctional chemicals 
(Baumert et al. (1978) Eur, J, Biochem. 59:353-359; and Oste et al. (1979) Mol Gen. 
Genet, 765:81-86) photochemical cross-linking (Vanin et al. (1981) FEBS Lett, 124:89- 
92; Rinke et al. (1980) y.Mo/.^io/. 737:301-314; Millon et al, (1980); Eur, J, Biochem, 
770:485-454). 

1 0 In particular, the reagents (N-acetyl-N -(p-glyoxylylbenzolyl) cystamine 

and 2-iminothjolane have been used to couple DNA to proteins, such as a 
2macroglobulin (a2M) via mixed disulfide formation (see, Cheng et al. (1983) Nucleic 
Acids Res, 77:659-669). N-acetyl-N-(p-glyoxylylbenzolyl)cystamine reacts 
specifically with nonpaired guanine residues and, upon reduction, generates a free 
15 sulfhydryl group. 2-Iminothiolane reacts with proteins to generate sulfhydryl groups 
that are then conjugated to the derivatized DNA by an intemiolecular disulfide 
interchange reaction. Any linkage may be used provided that, upon internalization of 
the conjugate the targeted nucleic acid is active. Thus, it is expected that cleavage of 
the linkage may be necessary, although it is contemplated that for some reagents, such 
20 as DNA encoding ribozymes linked to promoters or DNA encoding therapeutic agents 
for delivery to the nucleus, such cleavage may not be necessary. 

Thiol linkages can be readily fonmed using heterbipfunctional reagents. 
Ainines have also been attached to the terminal 5* phosphate of unprotected 
oligonucleotides or nucleic acids in aqueous solutions by reacting the nucleic acid with 
25 a water-soluble carbodiimide, such as l-ethyl-3*[3-dimethylaminopropyl]carbodiimide 
(EDC) or N-ethyUNX3-dimethylaminopropylcarbodiimidehydrochloride (EDCI), in 
imidazole buffer at pH 6 to produce the 5*phosphorirnidazolide. Contacting the 
5*phosphorimidazolide with amine-containing molecules, such as a VEGF, and 
ethylenediamine, results in stable phosphoramidates {see. e.g., Chu et al. (1983) Nucleic 
30 Acids Res, 77:6513-6529; and WO 88/05077). In particular^ a solution of DNA is 
saturated with EDC, at pH 6 and mcubated with agitation at 4^C overnight. The 
resulting solution is then buffered to pH 8.5 by adding, for example about 3 volutes of 
100 mM citrate buffer, and adding about 5 ng - about 20 ^ig of a VEGF. and agitating 
the resulting mixtxire at 4^C for about 48^ hours. The unreacted protein may be removed 
35 from the mixture by colimm chromatography using^ for example, Sephadex G75 
(Pharmacia) using 0.1 M anunonium carbonate solution, pH 7,0 as an eluting buffer. 
The isolated conjugate may be lyophilized and stored imtil used. 
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U.S. Patent No. 5,237,016 provides methods for preparing nucleotides 
that are bromacetylated at their 5* termini and reacting the resuhing oligonucleotides 
with thiol groups. Oligonucleotides derivatized at their 5'-tennini bromoacetyl groups 
can be prepared by reacting 5 -aminohexyl-phosphoramidate oligonucleotides with 
5 bromoacetic acid*N-hydroxysuccinimide ester as described in U.S. Patent No. 
5^37,016. U.S. Patent No. 5^37,016 also describes methods for preparing thiol- 
derivatized nucleotides, which can then be reacted with thiol groups on the selected 
growth factor. Briefly, thiol-derivatized nucleotides are prepared using a 5'-phos- 
phorylated nucleotide in two steps: (1) reaction of the phosphate group with imidazole 

10 in the presence of a diimide and displacement of the imidazole leaving group with 
cystamine in one reaction step; and reduction of the disulfide bond of the cystamine 
linker with dithiothreitol (see, also, Orgel et al. ((1986) Nucl Acids Res, 7^:651, which 
describes a similar procedure). The 5 -phosphoiylated starting oligonucleotides can be 
prepared by methods known to those of skill in the art (see. e.^., Maniatis et al. (1982) 

15 Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New 
York, p. 122). 

The antisense oligomer or nucleic acid, such as a methylphosphonate 
oligonucleotide (MP-oligomer), may be derivatized by reaction with SPDP or SMPB, 
The resulting MP-oligomer may be purified by HPLC and then coupled to VEGF, 
20 which may be modified replacement of one or more non-essential cysteine residues^ as 
described above. The MP-oligomer (about 0.1 ^M) is dissolved in about 40-50 of 
1:1 acetonitrile/water to which phosphate bu#er (pH 7.5, final concentration O.I M) and 
a 1 mg MP-oligomer in about 1 ml phosphate buffered saline is added. The reaction is 
allowed to proceed for about 5-10 hours at room temperature and is then quenched with 
25 about 15 fiL 0.1 iodoacetamide. The VEGF-oligonucleotide conjugates can be purified 
on heparin sepharose Hi-Trap columns (1 ml, Pharmacia) and eluted with a linear or 
step gradient. The conjugate should elute in 0.6 M NaCl. 

f. Nucleic acids encoding cytocides 

30 A cytocide-encoding agent is a nucleic acid molecule (DNA or RNA) 

that, upon internalization by a celL and subsequent transcription and/or translation into 
a cytocidal agent, is cytotoxic to a cell or inhibits cell growth by inhibiting protein 
synthesis. 

Cytocides include saporin. 4he ricins. abrin and other ribosome- 
inactivating proteins, Pseudomonas exotoxin, diptheria toxin, angiogenin, tritin. 
dianthins 32 and 30, momordin, pokeweed antiviral protein, mirabilis antiviral protein. 
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bryodin, angiogenin, and shiga exotoxin, as well as other cytocides that are known to 
those of skill in the art. 

Especially of interest are DNA molecules that encode an enzyme that 
results in cell death or renders a cell susceptible to cell death upon the addition of 
5 another product. For example, sapoiin, a preferred cytocide, is an enzyme that cleaves 
rRNA and inhibits protein synthesis. Other enzymes that inhibit protein synthesis are 
especially well suited for use in the present invention. In addition, enzymes may be 
used where the enzyme activates a compound with little or no cytotoxicity into a toxic 
product that inhibits protein synthesis. 
10 In addition to saporin discussed above, other cytocides that inhibit 

protein synthesis are useful in the present invention. The gene sequences for these 
cytocides may be isolated by standard methods, such as PCR, probe hybridization of 
genomic or cDN A libraries, antibody screenings of expression libraries, or obtain 
clones from commercial or other sources. The DNA sequences of many of these 
1 5 cytocides arc well known, including ricin A chain (Genbank Accession No. X02388); 
maize ribosome-inactivating protein (Genbank Accession No. L26305); gelonin 
(Genbank Accession No. LI 2243; PCT Application WO 92/03155; U.S. Patent No. 
5,376,546; diphtheria toxin (Genbank Accession No. KOI 722); trichosanthin (Genbank 
Accession No. M34858); tritin (Genbank Accession No. D13795); pokeweed antiviral 
20 protein (Genbank Accession No. X78628); mirabilis antiviral protein (Genbank 
Accession No. D90347); dianthin 30 (Genbank Accession No. X59260); abrin 
(Genbank Accession No. X55667); sihiga (Genbank Accession No. Ml 9437) and 
Pseudomonas exotoxin (Genbank Accession Nos. KOI 397, M23348). 

In the case of cytotocide molecules such as the ribosome-inactivating 
25 proteins, very few molecules may need be present for cell killing. Indeed, only a single 
molecule of diphtheria toxoid introduced into a cell was sufficient to kill the cell. In 
other cases, it may be that propagation or stable maintenance of the construct is 
necessary to attain sufficient numbers or concentrations of the gene product for 
effective gene therapy. Examples of replicating and stable eukaiyotic plasmids are 

30 found in the scientific literature. 

In general, constructs will also contain elements necessary for 
transcription and translation. If the cytocide-encoding agent is DNA, then it must 
contain a promoter. The choice of the promoter will depend upon the cell type to be 
transformed and the degree or type of control desired. Promoters can be constitutive or 

35 active in any cell type, tissue specific, cell specific, event specific or inducible. Cell- 
type specific promoters and event type specific promoters are preferred. Examples of 
constitutive or nonspecific promoters include the SV40 early promoter (U.S. Patem No. 
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5,118,627), the SV40 late promoter (U.S. Patent No. 5,118,627), CMV early gene 
promoter (U.S. Patent No. 5,168,062), and adenovirus promoter. In addition to viral 
promoters, cellular promoters are also amenable within the context of this invention. In 
particular, cellular promoters for the so-called housekeeping genes are useful. Viral 
5 promoters are preferred, because generally they are stronger promoters than cellular 
promoters. 

Tissue specific promoters are particularly useful when a particular tissue 
type is to be targeted for transformation. By using one of this class of promoters, an 
extra margin of specificity can be attained. For example, when the indication to be 

10 treated is ophthalmologic^, either the alpha-crystalline promoter or gamma-crystalline 
promoter is preferred. When a tumor is the target of gene delivery, cellular promoters 
for specific tumor markers or promoters more active in tumor cells should be chosen. 
Thus, to transform prostate tumor cells the prostate-specific antigen promoter is 
especially useful. Similarly, the tyrosinase promoter or tyrosinase-related protein 

15 promoter is a preferred promoter for melanoma treatment. For B lymphocytes, the 
immunoglobulin variable region gene promoter, for T lymphocytes, the TCR receptor 
variable region promoter, for helper T lymphocytes, the CD4 promoter, for liver, the 
albumin promoter, are but a few examples of tissue specific promoters. Many other 
examples of tissue specific promoters are readily available to one skilled in the art 

20 Inducible promoters may also be used. These promoters include the 

MMTV LTR (PCT WG 91/13160), which is inducible by dexamcthasone. 
metallothionein, which is inducible by heavy metals, and promoters with cAMP 
response elements, which are inducible by cAMP. By using an inducible promoter, the 
nucleic acid may be delivered to a cell and will remain quiescent until the addition of 

25 the inducer. This allows further control on the timing of production of the therapeutic 
gene. 

Event-typ>e specific promoters are active only upon the occurrence of an 
event, such as tumorigenecity or viral infection. The HIV LTR is a well known 
example of an event-specific promoter. The promoter is inactive unless the tat gene 

30 product is present, which occurs upon viral infection. 

Additionally, promoters that are coordinately regulated with a particular 
cellular gene may be used. For example, promoters of genes that are coordinately 
expressed when a particular VEGF receptor gene is expressed may be used. Then, the 
nucleic acid will be transcribed when the -VEGF receptor, such as VEGFRL is 

35 expressed, and not when VEGFR2 is expressed. This type of promoter is especially 
useful when one knows the pattern of VEGF receptor expression in a particular tissue. 
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so that specific cells within that tissue may be killed upon transcription of a cytotoxic 
agent gene without affecting the surrounding tissues. 

Alternatively, cytocide gene products may be noncytotoxic but activate a 
compoimd, which is endogenously produced or exogenously applied, from a nontoxic 

5 form to a toxic product that inhibits protein synthesis. 

The construct must contain the sequence that binds to the nucleic acid 
binding domain, if the domain binds in a sequence specific manner. As described 
below, the target nucleotide sequence may be contained within the coding region of the 
cytocide, in which case, no additional sequence need be incorporated. It may be 

1 0 desirable to have multiple copies of target sequence. If the target sequence is coding 
sequence, the additional copies must be located in non^oding regions of the cytocide- 
encoding agent. The target sequences of the nucleic acid binding domains are typically 
generally known. The tai^et sequence may be readily determined, in any case. 
Techniques are generally available for establishing the target sequence (e.g., see PCT 

1 5 Application WO 92/05285 and U.S. Serial No. 586,769). 

Specificity of delivery is achieved by coupling a nucleic acid binding 
domain to a receptor-binding internalized ligand, either by chemical conjugation or by 
constructing a fusion protein. Linkers as described above may be used. The receptor- 
binding internalized ligand part confers s(>ecificity of delivery in a cell -specific manner. 

20 The choice of the receptor-binding internalized ligand to use will depend upon the 
receptor expressed by the target cells. The receptor type of the target cell population 
may be determined by conventional techniques such as antibody staining, PCR of 
cDNA using receptor-specific primers, and biochemical or fimctional receptor binding 
assays. It is preferable that the receptor be cell type specific or have increased 

25 expression or activity (i.e., higher rate of internalization) within the target cell 
population. 

The nucleic acid binding domain can be of two types, non-specific in its 
ability to bind nucleic acid, or highly specific so that the amino acid residues bind only 
the desired nucleic acid sequence. Nonspecific binding proteins, polypeptides, or 

30 compounds are generally polycations or highly basic. Lys and Arg are the most basic 
of the 20 common amino acids; proteins enriched for these residues are candidates for 
nucleic acid binding domains. Examples of basic proteins include hisiones, protamines, 
and repeating units of lysine and arginine. Poly-L-lysine is a well-used nucleic acid 
binding domain {see U.S. Patent Nos. 5,166320 and 5.354,844). Other polycations. 

35 such as spermine and spermidine, may also be used to bind nucleic acids. By way of 
example, the sequence-specific proteins including Sp-K AP-L myoD and the rev gene 
product from HIV may be used. Specific nucleic acid binding domains can be cloned 
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in tandem, individually, or multiply to a desired region of the receptor-binding 
internalized ligand of interest. Alternatively, the domains can be chemically conjugated 
to each other. 

The corresponding response elements that bind sequence-specific 
5 domains are incorporated into the construct to be delivered. Complexing the cytocidal- 
encoding agent to the receptor-binding internalized ligand/nucleic acid binding domain 
allows specific binding of response element to the nucleic acid binding domain. Even 
greater specificity of binding may be achieved by identifying and using the minimal 
amino acid sequence that binds to the cytocidal-encoding agent of interest. For 
10 example, phage display methods can be used to identify amino acids residues of 
varying length that will bind to spyecific nucleic acid sequences with high affinity. {See 
U.S. Patent No. 5,223,409.) The peptide sequence can then be cloned into the receptor- 
binding internalized ligand as a single copy or multiple copies. Alternatively, the 
peptide may be chemically conjugated to the receptor-binding internalized ligand. 
15 Incubation of the cytocide-encoding agent with the conjugated proteins will result in a 

specific binding between the two. 

These complexes may be used to deliver nucleic acids that encode 
saporin or other cytocidal proteins into cells that have appropriate receptors that are 
expressed, over-expressed or more active in internalization ufMjn binding. The cytocide 

20 gene is cloned downstream of a manunalian promoter such as SV40, CMV, TK or 
Adenovims promoter. As described above, j^romoters of interest may be active in an: 
cell type, active only in a tissue-specific manner, such as a-crystalline or tyrosinase, 
event specific or inducible, such as the MK4TV LTR. 

Receptor-binding internalized ligands are prepared as discussed by any 

25 suitable method, including recombinant DNA technology, isolation from a suitable 
source, purchase from a conmiercial source, or chemical synthesis. The selected linker 
or linkers is (are) linked to the receptor-binding internalized ligands by chemical 
reaction, generally relying on an available thiol or amine group on the receptor-binding 
internalized ligands. Heterobifunctional linkers are particularly suited for chemical 

30 conjugation. Altematively. if the linker is a peptide linker, then the receptor-binding 
internalized ligands, linker and nucleic acid binding domain can be expressed 

recombinantly as a fusion protein. 

VEGF may be isolated from a suitable source or may be produced using 
recombinant DNA methodology, discussed below. To effect chemical conjugation 
35 herein, the growth factor protein is conjugated generally via a reactive amine ^oup or 
thiol group to the nucleic acid binding domain directly or through a linker to the nucleic 
acid binding domain. The growth factor protein is conjugated either via its N-tenninus. 
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C-tenninus, or elsewhere in the polypeptide. In preferred embodiments, the growth 
factor protein is conjugated via a reactive cysteine residue to the linker or to the nucleic 
acid binding domain. The growth factor can also be modified by addition of a cysteine 
residue, either by replacing a residue or by inserting the cysteine, at or near the amino 
5 or carboxyl temiinus, within about 20, preferably 10 residues from either end, and 
preferably at or near the amino terminus. 

In certain embodiments, the heterogeneity of preparations may be 
reduced by mutagenizing the growth factor protein to replace reactive cysteines, 
leaving, preferably, only one available cysteine for reaction. The growth factor protein 

10 is modified by deleting or replacing a site(s) on the growth factor that causes the 
heterogeneity. Such sites are typically cysteine residues that, upon folding of the 
protein, remain available for interaction with other cysteines or for interaction with 
more than one cytotoxic molectile per molecule of heparin-binding growth factor 
l>eptide. Thus, such cysteine residues do not include any cysteine residue that are 

1 5 required for proper folding of the growth factor or for retention of the ability to bind to 
a growth factor receptor and internalize. For chemical conjugation, one cysteine 
residue that, in physiological conditions, is available for interaction, is not replaced 
because it is used as the site for linking the cytotoxic moiety. The resulting modified 
heparin-binding growth factor is conjugated with a single species of cytotoxic 

20 conjugate. 

Alternatively, the contribution of each cysteine to the ability to bind to 
VEGF receptors may be determined empirically. Each cysteine residue may be 
systematically replaced with a conservative amino acid change (see Table 1 , above) or 
deleted. The resulting mutein is tested for the requisite biological activity: the ability to 

25 bind to growth factor receptors and internalize linked nucleic acid binding domain and 
agents. If the mutein retains this activity, then the cysteine residue is not required. 
Additional cysteines are systematically deleted and replaced and the resulting muieins 
are tested for activity. Each of the remaining cysteine residues may be systematically 
deleted and/or replaced by a serine residue or other residue that would not be expected 

30 to alter the structure of the protein. The resulting peptide is tested for biological 
activity. If the cysteine residue is necessary for retention of biological activity it is not 
deleted; if it not necessary, then it is preferably replaced with a serine or other residue 
that should not alter the secondary structure of the resulting protein. In this manner the 
minimum ntmiber and identity of the cysteines needed to retain the ability to bind to a 

35 heparin-binding growth factor receptor and internalize may be determined. It is noted, 
however, that modified or mutant heparin-binding growth factors may exhibit reduced 
or no proliferative activity, but may be suitable for use herein, if they retain the abilit>' 
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to target a linked cytotoxic agent to cells bearing receptors to which the unmodified 
heparin-binding growth factor binds and resuh in internalization of the cytotoxic 
moiety. In the case of monomeric VEGF, VEGF121 contains 9 cysteines and each of 
VEGFi65» VEGFig9 and VEGF2O6 contain 7 additional residues in the region not 
5 present in VEGF12I- Any of the 7 are likely to be non-essential for targeting and 
internalization of linked C5rtotoxic agents. Recently, the role of Cys-25, Cys-56, 
Cys-67, Cys-101, and Cys-145 in dimerization and biological activity was assessed 
(Claffery et al., Biochem, Biophys, Ada 1246:\-9, 1995). Dimerization requires 
Cys-25, Cys-56, and Cys-67. Substitution of any one of these cysteine residues 

10 resulted in secretion of a monomeric VEGF, which was inactive in both vascular 
permeability and endothelial cell mitotic assays. In contrast, substitution of Cys 145 
had no effect on dimerization, although biological activities were somewhat reduced. 
Substitution of Cys-101 did not result in the production of a secreted or cytoplasmic 
protein. Thus, substitution of Cys-145 is preferred. 

15 The VEGF monomers are preferably linked via non-essential cysteine 

residues to the linkers or to the targeted agent. VEGF that has been modified by 
introduction of a Cys residue at or near one terminus, preferably the N-terminus is 
preferred for use in chemical conjugation (see Examples for preparation of such 
modified VEGF). For use herein, preferably the VEGF is dimerized prior to linkage to 

20 the linker and/or targeted agent. Methods for coupling proteins to the linkers, such as 
the heterobifunctional agents, or to nucleic acids, or to proteins are known to those of 
skill in the art and are also described herein. 

Methods for chemical conjugation of proteins are known to those of skill 
in the art. The preferred methods for chemical conjugation depend on the selected 

25 components, but preferably rely on disulfide bond foraiation. For example, if the 
targeted agent is SPDP-derivatized saporin, then it is advantageous to dimerize the 
VEGF moiety prior coupling or conjugating to the derivatized saporin. If VEGF is 
modified to include a cysteine residue at or near the preferably, or C- terminus, then 
dimerization should follow coupling to the nucleic acid binding domain. To effect 

30 chemical conjugation herein, the VEGF polypeptide is linked via one or more selected 
linkers or directly to the nucleic acid binding domain. 

A nucleic acid binding domain is prepared fqx chemical conjugation. 
For chemical conjugation, a nucleic acid binding donuiin may be derivatized with 
SPDP or other suitable chemicals. If the binding domain does not have a Cys residue 

35 available for reaction, one can be either inserted or substituted for another amino acid. 
If desired^ mono-derivatized species may be isolated, essentially as described. 
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For chemical conjugation, the nucleic acid binding domain may be 
derivatized or modified such that it includes a cysteine residue for conjugation to the 
receptor-binding internalized ligand. Typically, derivatization proceeds by reaction 
with SPDP. This results in a heterogeneous population. For example, nucleic acid 

5 binding domain that is derivatized by SPDP to a level of 0.9 moles pyridine-disulfide 
per mole of nucleic acid binding domain includes a population of npn-derivatized, 
mono-derivatized and di-derivatized SAP. nucleic acid binding domain proteins, which 
are overly derivatized with SPDP, may lose ability to bind nucleic acid because of 
reaction with sensitive lysines (Lambert et al.. Cancer Treat. Res. 37:175-209, 1988). 

1 0 The quantity of non-deri vatized nucleic acid binding domain in the preparation of the 
non-purified material can be difficult to judge and this may lead to errors in being able 
to estimate the correct proportion of derivatized nucleic acid binding domain to add to 

the reaction mixture. 

Because of the removal of a negative charge by the reaction of SPDP 

1 5 with lysine, the three species, however, have a charge difference. The methods herein 
rely on this charge difference for purification of mono-derivatized nucleic acid binding 
donuiin by Mono-S cation exchange chromatography. The use of purified mono- 
derivatized nucleic acid binding domain has distinct advantages over the non-purified 
material. The amount of receptor-binding internalized ligand that can react with 

20 nucleic acid binding domain is limited to one molecule with the mono-derivatized 
material, and it is seen in the results presented herein that a more homogeneous 
conjugate is produced. There may stilK be sources of heterogeneity with the mono- 
derivatized nucleic acid binding domain used here but is acceptable as long as binding 
to the cytocide-encoding agent is not impacted. 

25 Because more than one amino group on the nucleic acid binding domain 

may react with the succinimidyl moiety, it is possible that more than one amino group 
on the surface of the protein is reactive. This creates potential for heterogeneity in the 
mono-derivatized nucleic acid binding domain. As an alternative to derivatizing to 
introduce a sulfhydryl, the nucleic acid binding domain can be modified by the 

30 introduction of a cysteine residue. Preferred loci for inuoduction of a cysteine residue 
include the N-terminus region, preferably within about one to twenty residues from the 
N-terminus of the nucleic acid binding domain. Using either methodology (reacting 
mono-derivatized nucleic acid binding domain introducing a Cys residue into nucleic 
acid binding domain), the resulting preparations of chemical conjugates are 
35 monogenous; compositions containing the conjugates also appear to be free of 
aggregates. As a preferred alternative, heterogeneity can be avoided by producing a 
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fusion protein of receptor-binding internalized ligand and nucleic acid binding domain, 
as described below. 

Expression of DNA encoding a fusion of a receptor-binding internalized 
ligand polypeptide linked to the nucleic acid binding domain results in a more 
5 homogeneous preparation of cytotoxic conjugates. Aggregate formation can be reduced 
in preparations containing the fusion proteins by modifying the receptor-binding 
internalized ligand, such as by removal of nonessential cysteines, and/or the nucleic 
acid binding domain to prevent interactions between conjugates via free cysteines. 

DNA encoding the polypeptides may be isolated, synthesized or 

10 obtained from conunercial sources or prepared as described herein. Expression of 
recombinant polypeptides may be performed as described herein; and DNA encoding 
these pol>'pcptides may be used as the starting materials for the methods herein. 

As described above, DNA encoding VEGF are described above. DNA 
may be prepared synthetically based on the amino acid or DNA sequence or may be 

15 isolated using methods known to those of skill in the art, such as PCR, probe 
hybridization of libraries, and the like or obtained from conunercial or other sources. 

As described herein, such DNA may then be mutagenized using standard 
methodologies to delete or replace any cysteine residues that are responsible for 
aggregate formation. If necessary, the identity of cysteine residues that contribute to 

20 aggregate formation may be determined empirically, by deleting and/or replacing a 
cysteine residue and ascertaining whether the resulting growth factor with the deleted 
cysteine forms aggregates in solutions containing physiologically acceptable buffers 
and salts. Loci for insertion of cysteine residues may also be determined empirically. 
Generally, regions at or near (within 20, preferably 10 amino acids) the C- or, 

25 preferably, the N-terniihus aire prefen«l. 

The DNA construct encoding the fusion protein can be inserted into a 
plasmid and expressed in a selected host, as described above, to produce a recombinant 
receptor-binding internalized ligand— nucleic acid binding domain conjugate. Multiple 
copies of the chimera can be inserted into a single plasmid in operative linkage with 

30 one promoter. When expressed, the resulting protein will then be a multimer. 
Typically, two to six copies of the chimera are inserted, preferably in a head to tail 
fashion/into one plasmid. 

To produce monogenous preparations of fusion protein, DNA VEGF is 
modified so that, upon expression, the resulting VEGF portion of the fusion protein 

35 does not include any cysteines available for reaction. In preferred embodiments, DNA 
encoding an VEGF polypeptide is linked to DNA encoding a nucleic acid binding 
domain. The DNA encoding the VEGF polypeptide or other receptor-binding 
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internalized ligand is modified in order to remove the translation stop codon and other 
transcriptional or translational stop signals that may be present and to remove or replace 
DNA encoding the available cysteines. The DNA is then ligated to the DNA encoding 
the nucleic acid binding domain polypeptide directly or via a linker region of one or 
5 more codons between the first codon of the nucleic acid binding domain and the last 
codon of the VEGF. The size of the linker region may be any length as long as the 
resulting conjugate binds and is internalized by a target cell. Presently, spacer regions 
of from about one to about seventy-five to ninety codons are preferred. The order of 
the receptor-binding internalized ligand and nucleic acid binding domain in the fiision 
1 0 protein may be reversed. If the nucleic acid binding domain is N-terminal, then it is 
modified to remove the stop codon and any stop signals. 

If the VEGF or other ligand has been modified so as to lack mitogenic 
activity or other biological activities, binding and internalization may still be readily 
assayed by any one of the following tests or other equivalent tests. Generally, these 
15 tests involve labeling the ligand, incubating it with target cells, and visualizing or 
measuring intracellular label. For example, briefly, VEGF may be fluorcscenUy labeled 
with HTC or radiolabeled with 125i. Fluorescein-conjugated VEGF is incubated with 
cells and examined microscopically by fluorescence microscopy or confocal 
microscopy for internalization. When VEGF is labeled with the labeled VEGF is 
20 incubated with cells at 4°C. Cells are temperature shifted to 37°C and washed with 
2 M NaGl at low pH to remove any cell-bound VEGF. Label is then counted and 
thereby measuring internalization of? VEGF. Alternatively, the ligand can be 
conjugated with an nucleic acid binding domain by any of Ae methods described herein 
and complexed with a plasmid encoding saporin. As discussed below, the complex 
25 may be \ised to transfect cells and cytoxicity measmed. 

The DNA encoding the resulting receptor-binding internalized ligand— 
nucleic acid binding domain can be inserted into a plasmid and expressed in a selected 
host, as described above, to produce a monogenous preparation. 

Multiple copies of the modified receptor-binding internalized 
30 ligand/nucleic acid binding domain chimera can be inserted into a single plasmid m 
operative linkage with one promoter. When expressed, the resulting protein will be a 
multimer. Typically two to six copies of the chimera are inserted, preferably in a head 
to tail fashion, into one plasmid. Merely by way of example, DNA encoding human 
bFGF- has been mutagenized using -splicing by overiap extension (SOE). Each 
35 application of the SOE method uses two amplified oligonucleotide products, which 
have complementary ends as primers and which include an altered codon at the locus at 
which the mutation is desired, to produce a hybrid product. A second amplification 
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reaction that uses two primers that anneal at the non-overlapping ends amplify the 
hybrid to produce DNA that has the desired alteration. 

The receptor-binding internalized ligand/nucleic acid binding domain is 
incubated with the cytocide-encoding agent, typically a DNA molecule, to be delivered 
5 under conditions that allow binding of the nucleic acid binding domain to the agent. 
Conditions will vary somewhat depending on the nature of the nucleic acid binding 
domain, but v^ll typically occur in 0.1 M NaCl and 20 mM HEPES or other similar 
buffer. 

The desired application is the delivery of cytotocidal agents, such as 

10 saporin, in a non-toxic form. By delivering a nucleic acid molecule capable of 
expressing saporin, the timing of cytotoxicity may be exquisitely controlled. For 
example, if saporin is expressed under the control of a tissue-specific promoter, then 
uptake of the complex by cells having the tissue-specific factors necessary for promoter 
activation will result in the killing of those cells. Oh the other hand, if cells taking up 

1 5 the complex do not have those tissue-specific factors, the cells will be spared. 

Merely by way of example, test constructs have been made and tested. 
One construct is a chemical conjugate of bFGF and poly-L- lysine. The bFGF molecule 
is a variant in which the Cys residue at position 96 has been changed to a serine; thus, 
only the Cys at position 78 is available for conjugation. This bFGF is called VEGF2-3. 

20 The poly-L-lysine was derivatized with SPDP and coupled to FGF2-3. This FGF2- 
3/poly-L-lysine conjugiate was used to deliver a plasmid able to express the P- 
galactosidase gene. 

The ability of a construct to^ bind nucleic acid molecules may be 
conveniently assessed by agarose gel electrophoresis. Briefly, a plasmid, such as pSVp 

25 , is digested with restriction enzymes to yield a variety of fiagment sizes. For ease of 
detection, the fragments may be labeled with ^^P either by filling in of the ends with 
DNA p6lymerase I or by phosphorylation of the 5*-end with polynucleotide kinase 
following dephosphorylation by alkaline phosphatase. The plasmid fragments are then 
incubated with the receptor-binding internalized ligand/nucleic acid binding domain in 

30 this case, FGF2-3/poly-L-lysine in a buffered saline solution, such as 20 mM HEPES. 
pH 7.3, O.IM NaCl. The reaction nlixture is electrophoresed on an agarose gel 
alongside siniilarly digested, but nonreacted fiagments. If a radioactive label was 
incorporated, the gel may be dried and autoradiographed. If no radioactive label is 
present, the gel may be stained with ethidium. bromide and the DNA visualized through 

35 appropriate red filters af^er excitation with UV. Binding has occurred if the mobilit>' of 
the fragments is retarded compared to the control. In the example case, 
the fragments was retarded after binding with the FGF2-3/poly-L-lysine conjugate. 
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Further testing of the conjugate is performed to show that it binds to the 
cell surface receptor and is internalized into the cell. It is not necessary that the 
receptor-binding internalized ligand part of the conjugate retain complete biological 
activity. For example, VEGF is mitogenic on certain cell types. As discussed above, 
5 this activity may not always be desirable. If this activity is present, a proliferation 
assay is performed. Likewise, for each desirable activity, an appropriate assay may be 
performed. However, for application of the subject invention, the only criteria that 
need be met are receptor binding and internalization. 

Receptor binding and internalization may be measured by the following 
10 . three assays. (1) A competitive inhibition assay of the complex to cells expressing the 
appropriate receptor demonstrates receptor binding. (2) Receptor binding and 
internalization may be assayed by measuring (Srgal expression (e.g., enzymatic activity) 
in cells that have been transformed with a complex of a P-gal containing plasmid 
condensed with a receptor-binding internalized ligand/nucleic acid binding domain. 
15 This assay is particularly useful for optimizing conditions to give maximal 
transformation. Thus, the optimum ratio of receptor-binding internalized ligand/nucleic 
acid binding donudn to nucleic acid and the amount of DNA per cell may readily be 
determined by assaying and comparing the enzymatic activity of P-gal. As such, these 
first two assays are useful for preliminary analysis and failure to show receptor binding 
20 or p-gal activity does not per se eliminate a candidate receptor-binding internalized 
ligand/nucleic acid binding doinain conjugate or fusion protein from further analysis. 
(3) The preferred assay is a cytotoxicity assay performed on cells transformed with a 
cytocide-encoding agent bound by receptor-binding internalized ligand/nucleic acid 
binding domain. While, in general, any cytocidal molecule may be used, ribosome- 
25 inactivating proteins are preferred and saporin, or another type I ribosome-inactivating 
protein, is particularly preferred. A statistically significant reduction in cell number 
demonstrates the ability of the receptor-binding internalized ligand/nucleic acid binding 
domain conjugate or fusion to deliver nucleic acids into a cell. 

30 C. Other elements 

1 • Nuclear translocation signals 

As used herein, a nuclear translocation or targeting sequence (NTS) is a 
sequence of amino acids in a protein that are required for translocation of the protein 
into a cell nucleus. Examples of NTS are set forth in Table 2. below. Comparison with 
35 known NTSs. and if necessary testing of candidate sequences, should permit those of 
skill in the art to readily identify other amino acid sequences that function as NTSs. 
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As used herein, heterologous NTS refers to an NTS that is different from 
the NTS that occurs in the wild-type peptide, polypeptide, or protein. For example, the 
NTS may be derived from another polypeptide, it may be synthesized, or it may be 
derived from another region in the same polypeptide. A typical consensus NTS 
5 sequence contaihsr an amino-termina) proline or glycine followed by at least three basic 
residues in a array of seven to nine amino acids {see, e.g. Hang et al, (1989) J. BioL 
Chem. 25^:18019-18023, Danget al. (1988) Cell. Biol. 5:4049-4058 and Table 2, 
which sets forth examples of NTSs and regions of proteins that share homology with 
known NTSs), 

10 
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TABLE 2 



1 SOURCE 1 


SEQUENCE 


SEO ID NO. i 


1 SV40 large T 1 


Pro'^*LysLysArgLysValGlu 


90 I 


1 Polyoma large T 


Pro^'' ProLysLysAlaArgGluVal 


91 


1 Human c-Myc | 


Pro'^^AlaAlaLysArgValLysLeuAsp 


92 


1 Adenovirus El A | 


Lys^^ArgProArgPro 


93 


1 Yeast mat a2 | 


Lys^lleProlleLys 


94 




A. Gly^LysArgLysArgLysScr 


95 


1 c-Erb-A ] 


B. Ser'^'LysArgValAlaLysArgLysleu 


96 




1 C. Ser'"HisTrpLysGlnLysArgLysPhe 


97 


1 c-Myb 


j Pro"'LeuLeuLysLysIleLysGln 


98 


1 P53 


1 Pro^'*GlnProLysLysLysPro 


99 


1 Nucleolin 


|Pro^"GlyLysArgLysLysGluMetThrLysGlnLysGluValPro 


100 


HIV Tat 


1 Gly**ArgLysLysArgArgGlnArgArgArgAlaPro 


101 


FGF-1 


1 AsnTyrLysLysProLysLeu 


102 


FGF-2 


1 HisPheLysAspProLysArg 


103 1 


FGF-3 


1 AlaProArgArgArgLysLeu 


72 


1 FGF-4 


1 IleLysArgLeuArgArg 


75 


FGF-5 


] GlyArgArg 




FGF-6 


1 IleLysArgGlnArgArg 


76 


FGF-7 


1 IleArgValArgArg 


84 


1 VEGFig9 


J LysArgLysArgLysLys (in EXON VI) 


85 


1 VEGF206 


1 LysArgLysArgLysLys (in EXON VI) 


85 1 


PDGF 


1 ProLysGlyLysHisArgLysPheLysHisThr 





Superscript indicates position in protein 
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2. Cytoplasm>translocation signal 

Cytoplasm-translocation signal sequence is a sequence of amino acids in a 
protein that cause retention of proteins in the lumen of the endoplasmic reticulum 
and/or translocate proteins to the cytosol. The signal sequence in mammalian cells is 
KDEL (Lys-Asp-Glu-Leu) (Munro and Pelham, Cell ^5:899-907. 1987). Some 
modifications of this sequence have been made without loss of activity. For example, 
the sequences RDEL (Arg-Asp-Glu-Leu) and KEEL (Lys-Glu-Glu-Leu) confer 
efficient or partial retention, respectively, in plants (Denccke et al.. Embo. J. J 1:2345- 
2355, 1992). 
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A cytoplasm-translocation signal sequence may be included in saporin or, for 
conjugates of VEGF with a nucleic acid binding domain, the sequence may reside in 
either part or both. If cleavable linkers are used in the conjugate, the cytoplasm- 
translocation signal is preferably included in saporin or the nucleic acid binding 
5 domain. Additionally, a cytoplasmic-translocation signal sequence may be included in 
VEGF, as long as it is placed so as not to interfere with receptor binding. 

In addition, or alternatively, membrane-disruptive peptides may be incorporated 
into complexes of VEGF-nucleic acid binding domain and cytocide-encoding agent. 
Adenoviruses are known to enhance disruption of endosomes. Virus-free viral proteins. 
10 such as influenza virus hemagglutinin HA-2, may be useful in the present invention. 
Other proteins may be tested in the assays described herein to find specific endosome 
disrupting agents that enhance gene delivery. In general, these proteins and peptides 
are aniphipathic (see, Wagner et al.^Adv. Drug, Dei Rev, 14:1 13-135, 1994). 

15 3. Linkers 

A linker is a peptide or other molecule that couples a VEGF polypeptide 
to the targeted agent. The linker may be bound via the N- or C-terminus or an internal 
reside, but, typically within about 20 amino acids of either terminus of a VEGF and/or 
targeted agent. The linkers provided herein increase intracellular availability, serum 

20 stability, specificity and solubility of the conjugate or provide increased flexibility or 
relieve steric hindrance in the conjugate. For example, specificity or intracellular 
availability of the targeted agent of miay be conferred by including a linker that is a 
substrate for certain proteases, such as a protease that is present in only certain 
subcellular compartments or that are present at higher levels in tunior cells than nomial 

25 cells. 

In order to increase the serum stability, solubility and/or intracellular 
concentration and to reduce steric hindrance caused by close proximity of VEGF and 
the targeted agent, one or more linkers iis (are) inserted between the VEGF protein and 
the targeted moiety. These linkers include peptide linkers, such as intracellular protease 

30 substrates and peptides that increase flexibility or solubility of the linked moieties, and 
chemical linkers, such as acid labile linkers, ribozyme substrate linkers and others. 
Peptide linkers may be inserted using heterobiofunctional reagents, described below, or. 
preferably, are linked to VEGF by linking DNA encoding the substrate to the DNA 
encoding the VEGF protein and expressing the resulting chimera. In instances in which 

35 the targeted agent is a protein, such as a RIP, the DNA encoding the linker can be 
inserted between the DNA encoding the VEGF protein and the DNA encoding the 
targeted protein agent. 
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Chemical linkers may be inserted by covalently coupling the linker to 
the VEGF protein and the targeted agent. The heterobiftmctional agents, described 
below, may be used to effect such covalent coupling. 

5 a. Protease substrates 

Peptides encoding protease-specific substrates are introduced between 
the VEGF protein and the targeted moiety. The peptides may be inserted using 
heterobiofunctional reagents, described below, or, preferably, are linked to VEGF by 
linking DNA encoding the substrate to the DNA encoding the VEGF protein and 

1 0 expressing the resulting chimera. In instances in which the targeted agent is a protein, 
such as a RIP, the DNA encoding the linker can be inserted between the DNA encoding 
the VEGF protein and the DNA encoding the targeted protein agent. For example, 
DNA encoding substrates specific for intracellular proteases has been inserted between 
the DNA encoding the VEGF protein and a targeted agent, such as saporin. 

15 Any protease specific substrate {see, e.g., O'Hare et al. (1990) FEBS 

275:200-204; Forsberg et al. (1991) J. Proiein Chem. 70:517-526; Westby et al. (1992) 
Bioconjuugate Chem. 3:375-381) may be introduced as a linker between the VEGF 
polypeptide and linked targeting agent as long as the substrate is cleaved in an 
intracellular compartment. Preferred substrates include those that are specific for 

20 proteases that arc expressed at higher levels in tumor cells or that are preferentially 
expressed in the endosome. The following substrates are among those cpntemplated 
for use in accord v^th the methods hcre^: cathepsin B substrate, cathepsin D substrate, 
trypsin substrate, thronibin substrate, and recombinant subtilisin substrate 
(XaaAspGluLeu SEQ ID NO. 50, particiaarly, PheAlaHisTyr, SEQ ID NO. 49). 

25 

b* Flexible linkers and linkers that increase the solubility of the 
conjugates 

Flexible linkers and linkers that increase solubiUty of the conjugates are 
contemplated for use, either alone or vrith other linkers, such as the protease specific 
30 substrate linkers. Such linkers include, but are not limited to, (Gly4Ser)n, (Ser4Gly)n 
and (AlaAlaProAla)n (see. SEQ ID NO. 48) in which n is 1 to 6, preferably 1-4. such 
as: 

(1) Gly4Ser SEQ ID NO. 4 0 
CCATGGGCGG CGGCGGCTCT GCCATGG 
35 (2) (Gly4Ser)2 SEQ ID NO. 41 

CCATGGGCGG CGGCGGCTCT GGCGGCGGCG GCTCTGCCAT GG 

(3) (Ser4Gly)4 SEQ ID NO. 42 
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CCATGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCTC GTCGTCGTCG 
GGCTCGTCGT CGTCGGGCGC CATGG 

(4) (Ser4Gly)2 SEQ ID NO. 43 
CCATGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCGC CATGG 
5 (5) (AlaAlaProTVla) n# where n is 1 to 4, 

preferably 2 (see, SEQ ID NO.:48) 

The linker Gly4Ser (SEQ ID No. 40) is preferred for VEGF-VEGF 
conjugates. The linker Ala-Met is preferred for SAP-VEGF chemical conjugates, and 
1 0 no linker is preferred for SAP-VEGF fusion proteins. In general, a linker length of 1 is 
preferred for conferring stability on the conjugates. 

c. Heterobifunctional cross*linkiiig reagents 

Numerous heterobifunctional cross-linking reagents that are used to 

15 form covalent bonds between amino groups and thiol groups and to introduce thiol 
groups into proteins, are known to those of skill in this art (see, e.g., the Pierce Catalog, 
InununoTechnology Catalog & Handbook, 1992-1993, which describes the preparation 
of and use of such reagents and provides a commercial source for such reagents; see, 
also, e.g.. Cumber et al. (1992) Bioconjugate Chem, 3:397-401; Thorpe et al. (1987) 

20 Cancer Res. -/7:5924-5931; Gordon et al. (1987) Proc. Natl Acad ScL 54:308012; 
Walden et al. (1986) J. Mol Cell Immunol 2:191-197; Carlsson et al. (1978) Biochem. 
J. J73: 723-737; Mahan et al. 91987) Anal Biochem. 762:163-170; Wawryznaczak et 
al. (1992) Br J. Cancer 66:361-366; Fattom et al. (1992) Infection i& Immun. 60:584- 
589). These reagents may be used to form covalent bonds between the VEGF 

25 polypeptide(s) with protease substrate peptide linkers and targeted protein agent. These 
reagents include, but are not limited to: N-succinimidyl-3-(2-pyridyldithio)propionate 
(SPDP; disulfide linker); sulfosuccinimidyl 6-[3-(2-pyridyldithio)propion- 
amidojhexanoate (sulfo-LC-SPDP); succinimidyloxycarbonyl-a-methyl benzyl 
thiosulfate (SMBT, hindered disulfate linker); succinimidyl 6-[3-(2-pyridyldithio) 

30 propionamido]hexanoate (LC-SPDP); sulfosuccinimidyl 4-(N- 

maIeimidomethyl)cyclohexane- 1 -carboxylate (sulfo-SMGC); succinimidyl 3-(2- 
pyridyldithio)butyrate (SPDB; hindered disulfide bond linker); sulfosuccinimidyl 2-(7- 
azido-4-methylcoumarin-3-acetamide) ethyl- K3*-dithiopropionate (SAED); sulfo- 
succinimidyl 7-azido-4-methylcoumarin-3-acetaie (SAMCA): sulfosuccinimidyl 6- 

35 {alpha*methyl-alpha-(2-pyridyldithio)toluamido]hexanpate (sulfo-LC-SMPT); 1,4-di- 
[3'-(2-pyridyldithio)propionaraido]butane (DPDPB); 4-succinimidyloxycarbonyl-a- 
methyl-a-(2-pyridylthio)toluene (SMPT. hindered disulfate linker):sulfosuccinimidyl6[ 
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a-niethyI-a-(2-pyridyldithio)toluainido]hexanoate (sulfo-LC-SMPT); m- 

maleimidobenzoyl-N-hydroxysuccinimide ester (MBS); m-maleimidobenzoyl-N- 
hydroxysulfosuccinimide ester (sulfo-MBS); N-succinimidyl(4- 

iodoacetyI)aminobenzoate (SIAB; thioether linker); sulfosuccinimidyI(4- 
5 iodoacetyl)amino benzoate (sulfo-SIAB); succinimidyl4(/7-maleinxidophenyl)butyTate 
(SMPB); sulfosuccinimidyl4-(p-maleimidophenyl)butyrate (sulfo-SMPB); 

azidobenzoyl hydrazide (ABH). These linkers should be particularly useful when used 
in combination with peptide linkers, such as those that increase flexibility. 

1 0 d. Acid cleavable, pbotocleavable and heat sensitive linkers 

Acid cleavable linkers include, but are not limited to, 
bismaleimideothoxy propane; and adipic acid dihydrazide linkers {see, e.g., Fattom et 
al. (1992) Infection & Immun dO:584-589) and acid labile transferrin conjugates that 
contain a sufficient portion of transferrin to pemiit entry into the intracellular transferrin 

15 cycling pathway {see, e.g., WelhSner et al. (1991) J. Biol Chem. 266:4309-4314). 
Conjugates linked via acid cleavable linkers should be preferentially cleaved in acidic 
intracellular compartments, such as the endosome. 

Pbotocleavable linkers are linkers that are cleaved upon exposure to light 
{see, e.g., Goldmacher et al. (1992) Bioconj, Chem. 3:104-107, which linkers are herein 

20 incorporated by reference), thereby releasing the targeted agent upon exposure to light. 
Pbotocleavable linkers are linkers that arc cleaved upon exposure to light are known 
{see, e.g., Hazum et al. (1981) in Pept, Pihc. Eur, PepL Symp., 16th, Brunfeldt, K (Ed), 
pp. 105-110, which describes the use of a nitrobenzyl group as a pbotocleavable 
protective group for cysteine; Yen et al. (1989) MakromoL Chem, 790:69-82, which 

25 describes water soluble pbotocleavable copolymers, including 
hydroxypropylmethacrylamide copolymer, glycine copolymer, fluorescein copolymer 
and methylrhodamine copolymer, Goldmacher et al. (1992) Bioconj. Chem, 3:104-107, 
which describes a cross-linker and reagent therefor that undergoes photolytic 
degradation upon exposure to near UV light (350 nm); and Senter et al. (1985) 

30 Photochem, Photobiol ^2:231-237, vAAch describes nitrobenzyloxycarbonyl chloride 
cross linking reagents that produce pbotocleavable linkages), thereby releasing the 
targeted agent upon exposure to light. Such linkers would have particular use in 
treating dermatological or ophthalmic conditions that can be exposed to light using 
fiber optics. After administration of the,conjugate, the eye or skin or other body part 

35 can be exposed to light, resulting in release of the targeted moiety from the conjugate. 
If the toxic moiety is a light activated porphyrin, light-exposure will also activate the 
porphyrin, thereby causing cell death. Use of pbotocleavable linkers should permit 
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administration of higher dosages of such conjugates compared to conjugates that 
release a cytotoxic agent upon internalization. Heat sensitive linkers would also have 
similar applicability. 

5 D. Expression vectors and host cells for expression of VEGF or targeted 
agents. 

DNA encoding the desired VEGF, polypeptide targeted agent or VEGF 
conjugate is inserted into a suitable vector and expressed in a suitable prokaryotic or 
eukaryotic host. Numerous suitable hosts and vectors are known and available to those 
10 of skill in this art and may be purchased commercially or constructed according to 
published protocols using well known and available starting materials. Suitable 
eukaryotic host cells include insect cells, yeast cells, and animal cells. Suitable 
prokaryotic host cells include E. coli, strains of Bacillus and Streptofnyces. E. coli is a 
preferred host cell. 

15 The DNA construct is introduced into ia plasmid for expression in a 

desired host. In preferred embodiments, the host is a bacterial host. The sequences of 
nucleotides in the plasmids that are regulatory regions, such as promoters and operators, 
are operationally associated with one another for transcription. The sequence of 
nucleotides encoding the growth factor or growth factor-chimera may also include 

20 DNA encoding a secretion signal, whereby the resulting peptide is a precursor protein. 
The resulting processed protein may be reccfvered from the periplasmic space or the 
fermentation medium. 

In preferred embodiments the DNA plasmids also include a transcription 
terminator sequence. The promoter regions and transcription temiinators are each 

25 independently selected from the same or dififCTent genes. 

The plasmids used herein include a promoter in operable association 
with the DNA encoding the protein or polypeptide of interest and are designed for 
expression of proteins in a bacterial host. It has been foimd tha[t tightly regulated 
promoters are ^preferred for expression of saporin. Suitable promoters for expression of 

30 proteins and polypeptides herein are widely available and are well known in the art. 
Inducible promoters or constitutive promoters that are linked to regulatory regions are 
preferred. Examples of suitable inducible promoters and promoter regions include, but 
are not limited to: the £. coli lac operator responsive to isopropyl 
P-D-thiogalactopyranoside OPTG; see, et al. Nakamura et al. (1979) Cell 75:1 109- 

35 1117); the metallothionein promoter metal-regulatory-elements responsive to heavy- 
metal (e.g., zinc) induction (see, e g,, U.S. Patent No. 4,870,0C»9 to Evans et al.): the 



PCT/US95/10973 



wo 96/06641 

46 

phage T71ac promoter responsive to IPTG {see, e.g., U.S. Patent No. 4,952,496; and 
Studieretal. (1990) A/er/i. £nr>wc>/. 755:60-89) and the tac promoter. Other promoters 
include, but are not limited to, the T7 phage promoter and other T7-like phage 
promoters, such as the T3, T5 and SP6 promoters, the trp, Ipp, and lac promoters, such 
5 as the lacUVS, from E, coli; the PI 0 or polyhedron gene promoter of baculovirus/insect 
cell expression systems (see, e.g., U.S. Patent Nos. 5^43,041, 5,242,687, 5,266,317, 
4,745,051, and 5,169,784) and inducible promoters from other eukaiyotic expression 
systems. 

Preferred promoter regions are those that are inducible and functional in 
10 E. coli. Examples of suitable inducible promoters and promoter regions include, but 
are not limited to: the E, coli lac operator responsive to isopropyl P 
-D-thiogalactopyranoside (IPTG; see, et al. Nakamura et al., Ce// /*:1 109-1 117, 1979); 
the metallothionein promoter metal-regulatory-elements req)onsive to heavy-metal 
{e.g., zinc) induction {see, e.g., U.S. Patent No. 4,870,009 to Evans et al.); the phage 
1 5 T71ac promoter responsive to IPTG {see, e.g., U.S. Patent No. 4,952,496; and Studier et 
al., Meth Enzymol. ;S5:60-89, 1990) and the tac promoter. 

The plasmids also preferably include a selectable marker gene or genes 
that are functional in the host. A selectable marker gene includes any gene that confers 
a phenotype on bacteria that allows transformed bacterial cells to be identified and 
20 iselectively grown from among a vast majority of imtransformed cells. Suitable 
selectable maricer genes for bacterial hosts, for example, include the ampicillin 
resistance gene (AmpO. tetracycline resistance gene (TcO and the kanamycin resistance 
gene (KanO* The kanamycin resistance gene is presently preferred. 

The plasmids may also include DN A encoding a signal for secretion of 
25 the operably linked protein. Secretion signals suitable for use are widely available and 
are well known in the art. Prokaryotic and eukaryotic secretion signals functional in 
E. coli may be employed. The presently preferred secretion signals include, but are not 
limited to, those encoded by the following £. coli genes: ompA, ompT, ompF, ompC. 
beta-lactamase, and alkaline phosphatase, and the like (von Heijne, J. Mol Biol 
30 754:99-105, 1985). In addition, the bacterial pelB gene secretion signal (Lei et al., J. 
Bacteriol 169:4379. 1987), the phoA secretion signal, and the cek2 functional in insect 
cell may be employed. The most preferred secretion signal is the £. coli ompA 
secretion signal. Other prokaryotic and eukaryotic secretion signals known to those of 
skill in the art may also be employed {see. e.g., von Heijne, J. Mol Biol 184:99-105. 
35 1985). Using the methods described herein, one of skill in the art can substitute 
secretion signals that are functional in either yeast, insect or manunalian cells to secrete 
proteins from those cells. 
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Preferred piasmids for transformation of E. coli cells include the pET 
expression vectors (see, U.S patent 4,952,496; available from Novagen, Madison, Wl; 
seCy also, literature published by Novagen describing the system). Such piasmids 
include pET 11a, which contains the T71ac promoter, T7 terminator, the inducible 
5 E. coli lac operator^ and the lac repressor gene; pET 12a-c, which contains the T7 
promoter, T7 temiinator, and the coli ompT secretion signal; and pET 1 5b (Novagen, 
Madison, WI), which contains a Mis-Tag^M leader sequence (Seq. ID No. 36) for use 
in purification with a His colimm and a thrombin cleavage site that permits cleavage 
following purification over the column; the T7-lac promoter region and the T7 
10 terminator. 

Other preferred piasmids include the pKK piasmids, particularly pKK 
223-3, A^ch contains the tac promoter, (available from Pharmacia; see, also^ Brosius et 
al., Proc. Natl Acad. Sci. 57:6929, 1984; Ausubel et al.. Current Protocols in 
Molecular Biology, U.S. Patent Nos. 5,122,463, 5,173,403, 5,187,153, 5,204,254, 

15 5,212,058, 5,212,286, 5,215,907, 5,220,013, 5,223,483, and 5,229,279). Plasmid pKK 
has been modified by replacement of the ampicillin resistance marker gene, by 
digestion with £coRI, with a kanamycin resistance cassette with £coRI sticky ends 
(purchased from Pharmacia; obtained from pUC4K, see^ e.g., Vieira et al. (Gene 
7P:259-268, 1982; and U.S. Patent No. 4,719,179) into the ampicillm resistance marker 

20 gene. 

Baculo virus vectors, such as a pBlueBac (also called pJVETL and 
derivatives thereof) vector, particularly pBlueBac III, (5ee, e.g., U.S. Patent Nos. 
5,278,050, 5,244,805, 5,243,041, 5,242,687, 5,266,317, 4,745,051, and 5,169,784; 
available from Itivitrogen, San Diego) may also be used for expression of the 

25 polypeptides in insect cells. The pBIueBacIII vector is a dual promoter vector and 
provides for the selection of recombinants by blue/white screening ias this plasmid 
contains the P-galactosidase gene (lacZ) under the control of the insect recognizable 
ETL promoter and is inducible with IPTG. A DNA construct may be made in 
baculovirus vector pBluebac III (Invitrogen, San Diego. CA) and then co-transfected 

30 with wild type virus into insect Spodoptera Jrugiperdq cells (sf9 cells: see^ e.g.. 
Luckow et al,. Bio/technology 6:47-55, 1988, and U.S. Patent No. 4,745,051). 

Other piasmids include the pIN-IIlompA piasmids (see. U.S. Patent No. 
4,575,013 to Inouye; see. also, Duffaud et al., Meth, Enz. 753:492-507, 1987), such as 
pIN-IIIompA2. The pIN-IIIompA piasmids include an insertion site for heterologous 

35 DNA linked in transcriptional reading frame with four functional fragments derived 
from the lipoprotein gene of £. coli. The piasmids also include a DNA fragment coding 
for the signal peptide of the ompA protein of £. coli, positioned such that the desired 
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polypeptide is expressed with the ompA signal peptide at its amino tenninus, thereby 
allowing efficient secretion across the cytoplasmic membrane. The plasmids further 
include DNA encoding a specific segment of the E. coli lac promoter-operator, which is 
positioned in the proper orientation for transcriptional expression of the desired 

5 polypeptide, as well as a separate functional £. coli lacl gene encoding the associated 
repressor molecule that, in the absence of lac operon inducer, interacts with the lac 
promoter-operator to prevent transcription therefirom. Expression of the desired 
polypeptide is imder the control of the lipoprotein (Ipp) promoter and the lac 
promoter-operator, although transcription from either promoter is normally blocked by 

1 0 the repressor molecule. The repressor is selectively inactivated by means of an inducer 
molecule thereby inducing transcriptional expression of the desired polypeptide from 
both promoters. 

A particularly preferred vector for expressing VEGF protein is pPf^ 
(Pharmacia Biotech, Uppsala, Sweden). This vector contains the tightly regulated 

1 5 leftward promoter of bacteriophage X, which is controlled by the cl repressor. The 
promoter is temperature-inducible by using a bacterial host strain, such as N4830-1, 
containing the temperature-sensitive cI857 repressor. The vector contains a unique 
Hpal site for cloning. Hpal digestion leaves blunt ends. The VEGF on VEGF- 
cytotoxic agent, such as VEGF-SAP, is prepared as a blunt-end fragment {see 

20 Examples) and ligated into pPt-^- Inclusion bodies containing the protein are isolated, 
solubilized and refolded. 

Preferably, the DNA fra^ent is replicated in bacterial cells, preferably 
in £. coli. The preferred DNA fragment also includes a bacterial origin of replication, 
to ensure the maintenance of the DNA fragment from generation to generation of the 

25 bacteria. In this way, large quantities of the DNA fragment can be produced by 
replication in bacteria. Preferred bacterial origins of replication include, but are not 
limited to, the fl-ori and colEI origins of replication. Preferred hosts contain 
chromosomal copies of DNA encoding T7 RNA polymerase operably linked to an 
inducible promoter, such as the lacUV promoter {see, Lf.S. Patent No. 4,952,496). 

30 Such hosts include, but are not limited to, lysogenic E. coli strains 
HMS174(DE3)pLysS, BL21(DE3)pLysS, HMS174(DE3) and BL21(DE3). Strain 
BL21(DE3) is preferred. The pLys strains provide low levels of T7 lysozyme, a natural 

inhibitor of T7 RNA polymerase. 

The DNA fragments provided may also contain a gene coding for a 
35 repressor-protein. The repressor-protein is capable of repressing the transcription of a 
promoter that contains sequences of nucleotides to which the repressor-protein binds. 
The promoter can be derepressed by altering the physiological conditions of the cell. 
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The alteration can be accomplished by the addition to the growth medium of a molecule 
that inhibits, for example, the ability to interact with the operator or with regulatory 
proteins or other regions of the DNA or by altering the temperature of the growth 
media. Preferred repressor-proteins include, but are not limited to the £. coli, lad 
5 repressor responsive to IPTG induction, the temperature sensitive cI857 repressor, and 
the like. The cI857 repressor is particularly preferred. 

The DNA construct is introduced into a plasmid suitable for expression 
in the selected host. The sequences of nucleotides in the plasmids that are regulatory 
regions, such as promoters and operators, are operationally associated with one another 

1 0 for transcription. The sequence of nucleotides encoding the VEGF, VEGF chimera or 
cytotoxic agent may also include DNA encoding a secretion signal, whereby the 
resulting peptide is a precursor protein. Secretion signals suitable for use are widely 
available and are well known in the art. Prokaryotic and eukaryotic secretion signals 
functional in E. coli^ may be employed. The presently preferred secretion signals 

1 5 include, but are not limited to, those encoded by the following E, coli genes: ompA, 
ompT, ompF, ompC, beta-lactamase, pelB and bacterial alkaline phosphatase, and the 
like (von Heijne (1985) J. Mol Biol /W:99-105). In addition, the bacterial pelB gene 
secretion signal (Lei ct al. (1987) J, Bacterial, 769:4379, 1987), the phoA secretion 
signal, and the cek2 secretion signal, functional in insect cells, may be employed. The 

20 most preferred secretion signal for bacterial expression is the E, coli ompA secretion 
signal. For eukaryotic expression systems, particularly insect cell systems, the signals 
from secreted proteins, such as insulin, growth hormone, mellitin, and manunalian 
alkaline phosphatase are of interest herein. Other prokaryotic and eukaryotic secretion 
signals known to those of skill in flie art may also be employed (5ee» e.g., von Heijne 

25 (1985) J. MoL Biol J84:99A0S). Using the methods described herein, one of skill in 
the art can substitute secretion signals that are functional in either yeast, insect or 
mammalian cells to secretes the heterologous protein from those cells. The resulting 
processed protein may be recovered from the periplasmic space or the fermentation 
meditmi or growth medium. 

30 In certain preferred embodiments, the constructs also include a 

transcription terminator sequence. The promoter regions and transcription terminators 
are each independently selected from the same or different genes. In some 
embodiments, the DNA fragment is replicated in bacterial cells, preferably in E. coli. 
The DNA fragment also typically includes a bacterial origin of replication, to ensxire the 

35 maintenance of the DNA fragment from generation to generation of the bacteria. In this 
way. large quantities of the DNA fragment can be produced by replication in bacteria. 
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Preferred bacterial origins of replication include, but are not limited to, the fl-ori and 
col El origins of replication. 

DNA encoding full-length VEGF, VEGF-SAP, SAP-VEGF with and 
without linkers, and other such constructs, are introduced into the pET vectors, 
5 preferably pET-lla (Novagen, Madison, Wl) or the pPf?^ vector (Pharmacia). It is 
fotmd that for expression in bacterial hosts that constructs in which DNA encoding SAP 
is linked, directly or via a linker, to DNA encoding the N-tcrminus of VEGF is 
preferred. When the SAP-VEGF 121 or SAP-VEGF] 65 is produced in pPfX, no linker 
is preferable. Also, constructs containing DNA encoding two monomers, which upon 
1 0 expression, dimerize, preferably in an antiparallel maimer, are preferred. 

E. Method of preparation of VEGF-targeted agent conjugates 

Conjugates that contain one or more VEGF polypeptides linked, either 
directly or via a linker, to one or more targeted agents are provided. The presently 
15 preferred VEGF monomers are VEGF165 and VEGF121. VEGFi2t is particularly 
preferred. 

As described above, the conjugates contain the following components: 
(VEGF)n, (L)q, and (targeted agcnt)m in which: at least one VEGF moiety is linked 
with or without a linker (L) to at least one targeted agent, n is 1 or more, generally is at 

20 least 2, and typically is between 2 and 6; q is 0 or more as long as the resulting 
conjugate binds to the targeted receptor, is internalized and delivers the targeted agent, 
q is generally 0 or 1 to 4; m is 1 or mor^, generally 1 or 2; L refers to a linker, and the 
targeted agent is any agent, such as a cytotoxic agent or a nucleic acid, or a drug, such 
as methotrexate, intended for internalization by a cell that expresses a receptor to which 

25 VEGF binds and upon binding is internalized. The components may be organized in 
any order. 

It is also understood that substitutions in codons by virtue of the 
degeneracy of the genetic code are encompassed by DNA encoding such VEGF. DNA 
encoding the VEGF polypeptide may be obtained from any source known to those of 
30 skill in the art; it may be isolated using standard cloning methods, synthesized or 
obtained from commercial sources, prepared as described in any of the patents and 

publications noted herein. 

In some embodiments^ the conjugates provided herein may be 

represented by the formula (I): 
35 (VEGFpKLVtargeted agent)n 

in which VEGF refers to a polypeptide that is reactive with a VEGF 
receptor (also referred to herein as a VEGF protein), such as VEGF. L refers to a 
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linker, which may be present or absent, q is 0 or more as long as the resulting conjugate 
binds to a targeted receptor and the targeted agent is internalized, p is 1 or more, 
preferably 1 , and generally less than or equal to 4, and the targeted agent is any agent, 
such as a cytotoxic agent or a nucleic acid, or a drug, such as methotrexate, intended for 

5 internalization by a cell that expresses a VEGF receptor; n is 1 or 2; and the VEGF may 
be linked to the linker or targeted agent via its N-terminus or C-terminus or any other 
locus in polypeptide; such as deriyatized cys residues. When n is 2, the conjugates are 
linked via cysteine residues on the VEGF, probably via residues that correspond to the 
cysteines at positions 77 and 86 in SEQ ID NOs. 25-28. The linked VEGFs may be 

0 linked in a parallel fashion or antiparallel fashion. Conjugates of the formula (11): 
targeted agent-L-(VEGF)n), in which n is 1 or 2, are also provided. These conjugates 
are prepared by mixing conjugates of formula I with unconjugated VEGF, by preparing 
fusion proteins from DNA constructs that encode two VEGF moieties, or by mixing 
conjugates of fomiulas I and II. The VEGF moieties are preferably linked via a linker 

5 to facilitate dimerization. 

It is understood that the VEGF and targeted agent (or linker and targeted 
agent) may be linked in any order and through any appropriate linkage, as long as the 
resulting conjugate binds to a receptor to which VEGF binds and internalizes the 
targeted agent(s) in cells bearing the receptor. 

0 For example, the VEGF polypeptide may be linked to the targeted agent 

or linker at or near its N-terminus or at or near its C-terminus, The VEGF may be 
linked to a second VEGF monomer, which may be the same monomer or a different 
monomer; and one or more targeted agents that are the same or different may be linked 
to the VEGF or may be linked to each other. The linkage may be at any locus, although 

5 the N-tcrminus region of VEGF (within about 20, preferably 10, amino acids from the 
N-terminus) is preferred. When multiple VEGFs are linked, they may be in a head to 
head, head to tail or tail to tail orientation. If more than one targeted agent is included, 
the second may be the same or different from the first agent. In order to efficiently bind 
to VEGF receptors and deliver a targeted agent to a celK VEGF dimerization appears to 

D be required. 

In addition* conjugates in which non-essential cysteines in the VEGF 
monomers and/or targeted agent, if the agent is a p)olypeptide, are deleted or replaced 
with Ser or other conservative substitution are provided. Compositions of such 
conjugates should exhibit reduced aggregation compared to conjugates that contain 
5 non-essential cysteines. Non-essential cysteines may be identified empirically. 

Polypeptides that are reactive with a VEGF receptor (VEGF proteins) 
include any molecule that reacts with VEGF receptors on cells that bear VEGF 
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receptors and results in internalization of the linked cytotoxic agent. Particularly 
preferred polypeptides that are reactive with a VEGF receptor include members of the 
VEGF family of polypeptides, muteins of these polypeptides, chimeric or hybrid 
molecules that contain portions of any of these family members, and any portion 
5 thereof that binds to VEGF receptors and internalizes a linked agent. 

The linker is selected to increase the specificity, toxicity, solubility, 
serum stability, and/or intracellular availability targeted moiety. More preferred linkers 
are those that can be incorporated in fusion proteins and expressed in a host cell, such 
as £. coli. Such linkers include: enzyme substrates, such as cathepsin B substrate, 
1 0 cathepsin D substrate, trypsin substrate, thrombin substrate, subtilisin substrate, factor 
Xa substrate, and cnterokinase substrate; linkers that increase solubility, flexibility, 
and/or intracellular cleavability, such as (glyniser)n and (seringly)n» in which n is 1 to 
6, preferably 1 to 4, most preferably 1, and m is 1 to 6, preferably 1 to 4, more 
preferably 12 to 4, most preferably. Preferred among such linkers, are those, such as 
15 cathepsin D substrate, that are preferentially cleaved in the endosome or cytoplasm 
following internalization of the conjugate linker; other such linkers, such as (glymscr)n 
and (seringly)n» also increase the flexibility, serum stability and/or solubility of the 
conjugate or the availability of the region joining the VEGF and targeted agent for 
cleavage. In some embodiments, several linkers that are the same or different may be 
20 included in order to take advantage of desired properties of each linker. 

Other linkers arc suitable for incorporation into chemically produced 
conjugates. Linkers that are suitable for chemically linking conjugates include 
disulfide bonds, thioether bonds, hindered disulfide bonds, and covalent bonds between 
' free reactive groups, such as amine and thiol groups. These bonds are produced using 
25 heterobifunctional reagents to produce reactive thiol groups on one or both of the 
polypeptides and then reacting the thiol groups on one polypeptide with reactive thiol 
groups or amine groups to which reactive maleimido groups or thiol groups can be 
attached on the other Other Hnkers include acid cleavable linkers, such as 
bismaleimideothoxy propane, acid labile-transferrin conjugates and adipic acid 
30 diihydrazide. that would be cleaved in more acidic intracellular compartments and cross 
linkers that are cleaved upon exposure to UV or visible light and linkers. 

The targeted agents or moieties include any molecule that, when 
internalized, alter the metabolism or gene expression in the cell . Such agents include 
cytotoxic agents, such as ribosome inactivating proteins DNA encoding cytotoxic 
35 agents, and antisense nucleic acids, that result in inhibition of growth or cell death. 
Other such agents also include antisense RNA. DNA. and truncated proteins that alter 
gene expression via interactions with the DNA. or co-suppression or other mechanism. 
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The conjugates herein may also be used to deliver DNA and thereby serve as agents for 
gene therapy or to deliver agents that, upon, transcription and/or translation thereof, 
result in cell death. Cytotoxic agents include, but are not limited to, ribosome 
inactivating proteins, inhibitors of DNA, RNA and/or protein synthesis, including 
5 antisense nucleic acids, and other metabolic inhibitors. In certain embodiments, the 
cytotoxic agent is a ribosome-inactivating protein (RIP), such as, for example, saporin, 
although other cytotoxic agents can also be advantageously used. 

The targeted agents may also be modified to render them more suitable 
for conjugation with the linker and/or a VEGF protein or to increase their intracellular 
10 activity. Such modifications include, but are not limited to, the introduction of a Cys 
residue at or near the N-terminus or C-terminus, derivatization to introduce reactive 
groups, such as thiol groups, and addition of sorting signals, such as (XaaAspGluLeu)n 
(SEQ ID NO. 50), where Xaa is Lys or Arg, preferably Lys, and n is 1 to 6, preferably 
1-3, at, preferably, the carboxy-terminus {see. e.g,, Seetharam et al. (1991) J. Biol, 
15 Chem, 26(5:17376-17381; and Buchner et al. (1992) AnaL Biochem. 205:263-270), that 
direct the tai^eted agent to the endoplasmic reticulum or the addition of a cytoplasmic 
sorting sequence, such as KDEL (see discussion herein). 

Conjugates that contain a plurality of monomers of a VEGF protein 
linked to the cytotoxic agent are also provided. These conjugates that contain several, 
20 typically two to about six monomers can be produced by linking multiple copies of 
DNA encoding the VEGF fusion protein, typically head-to-tail, under the 
transcriptional control of a single promoter fcgion. In addition conjugates that contain, 
more than one targeted agent per VEGF, such as SAP-VEGF-SAP. linked with or 
without linkers are contemplated herein. 

25 

1. Chemical conjugation 

a. Preparation of VEGF polypeptides for chemical conjugation 

VEGF may be isolated from a suitable source or may be produced using 
recombinant DNA methodolojgy, discussed below. As used herein, substantially pure 

30 means sufficiently homogeneous to appear free of readily detectable impurities as 
determined by standard methods of analysis, such as thin layer chromatography (TLC). 
gel electrophoresis, high performance liquid chromatography (HPLC), used by those of 
skill in the art to assess such purity, or sufficiently pure such that further purification 
would not detectably alter the physical and chemical properties, such as enzymatic and 

35 biological activities, of the substance. Methods for purification of the compounds to 
produce substantially chemically pure compounds are known to those of skill in the art. 
A substantially chemically pure compound may. however, be a mixture of 
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stereoisomers. In such instances, further purification ihight increase the specific 
activity of the compound. 

To effect chemical conjugation herein, the VEFG protein is conjugated 
generally via a reactive amine group or thiol group to the targeted agent or to a linker, 
5 which has been or is subsequently linked to the targeted agent. The VEGF protein is 
conjugated either via its N-terminus, C-tcrminus, or elsewhere in the polypeptide. In 
preferred embodiments, the VEGF protein is conjugated via a reactive cysteine residue 
to the linker or to the targeted agent. The VEGF can also be modified by addition of a 
cysteine residue, either by replacing a residue or by inserting the cysteine, at or near the 
1 0 amino or carboxy 1 terminus, within about 20, preferably 1 0 residues from cither end, 
and preferably at or near the amino terminus. 

In preferred embodiments, to reduce the heterogeneity of preparations, 
the VEGF protein is modified by mutagenesis to replace reactive cysteines, leaving, 
preferably, only one available cysteine for reaction. The VEGF protein is modified by 
1 5 deleting or replacing a site(s) on the VEGF that causes the heterogeneity. Such sites are 
typically cysteine residues that, upon folding of the protein, remain available for 
interaction with other cysteines or for interaction with more than one cytotoxic 
molecule per molecule of VEGF peptide. Thus, such cysteine residues do not include 
any cysteine residue that are required for proper folding of the VEGF peptide or for 
20 retention of the ability to bind to a VEGF receptor and internalize. For chemical 
conjugation, one cysteine residue that, in physiological conditions, is available for 
interaction, is not replaced because it is used as the site for linking the cytotoxic moiety. 
The resulting modified VEGF is conjugated with a single species of cytotoxic 
conjugate. 

25 Alternatively, the contribution of each cysteine to the ability to bind to 

VEGF receptors may be determined empirically, as described above. Recently the role 
of Cys-25, Cys-56, Cys-67. Cys-101, and Cys 145 in dimerization and biological 
activity was assessed (Claffery, siipra). Cys-25, Cys-56. and Cys-67 are required for 
dimerization; Cys-lOI is required for expression. Sustimtion of Cys- 145 is preferred. 

30 

b. Preparation of targeted proteins for chemical conjugation 

If the targeted agent is a polypeptide it may be directly linked to the 
VEGF or VEGF with linker or to a linker by reaction of a reactive group in the 
polypeptide. It is desirable, however, that the agent may react at only a single locus, so 
35 that the resulting preparation of conjugates is homogeneous. Thus, if necessary, the 
targeted agent can be derivatized and then a single species isolated. Alternatively, and 
preferably for chemical conjugation, saporin can be modified so that it only has one 
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reactive group, such as a cysteine, for a particular set of conditions and reagents. For 
example, saporin has been derivatized and a single species isolated and has also been 
modified by introduction of a single cysteine residue. 

Saporin for chemical conjugation may be produced by isolating the 
5 protein from the leaves or seeds of Saponaria officinalis or using recombinant methods 
and the DNA provided herein or known to those of skill in the art or obtained by 
screening appropriate libraries (see, e.^.. International PCT Application WO 93/25688, 
which describes the isolation of saporin, plasmids containing DNA encoding saporin. 
expression of saporin and isolation of purified saporin). Some DNA encoding saporin 

10 may also include an N-terminal extension sequence linked to the amino terminus of the 
saporin that encodes a linker so that, if desired, the SAP and linker can be expressed as 
a fusion protein as described herein. The sequence of DNA encoding saporin is set 
forth in SEQ ID Nos. 3-7. 

The DNA molecules provided herein encode saporin that has 

15 substantially the same amino acid sequence and ribosome-inactivating activity as that 
of saporin-6 (SO-6), including any of four isoforms, which have heterogeneity at amino 
acid positions 48 and 91 (see, e.g,^ Maras et al. (1990) Biochem, Internat, 27:631-638 
and Barra et al. (1991) BiotechnoL Appl Biochem, 73:48-53 and SEQ ID NOs. 3-7). 
Other suitable saporin polypeptides include other members of the multi-gene family 

20 coding for isoforms of saporin-type RIPs including SO- 1 and SO-3 (Fordham-Skelton 
et al. (1990) Mol Gen Genet 227:134-138), SO-2 (see, elg , U.S. Application Serial 
No. 07/885,242, which corresponds to GB 2^16;891; see, also, Fordham-Skelton et al. 
(1991) Mol Gen, Genet 229:460-466), SO-4 (see. e.g,, GB 2,194,241 B; see, also, 
Lappi et al. (\9i5 Biochem. Biophys, Res. Commun, 729:934-942) and SO-5 (see, e.g.. 

25 GB 2,194,241 B; see, also, Montecucchi et al. (1989) Int. J, Peptide Protein Res., 
55:263-267; and Ferreras et al. (1993) Biophys. Biochem. Acta J 21 6:31-42). SO-4. 
which includes the N-terminal 40 amino acids set forth in SEQ ID NO. 77^ is isolated 
from the leaves of Saponaria officinalis by extraction with 0.1 M phosphate buffer at 
pH 7, followed by dialysis of the supernatant against sodium borate buffer, pH 9. and 

30 selective elution from a negatively charged ion exchange resin, such as Mono S 
(Pharmacia Fine Chemicals, Sweden) using a gradient of 1 to 0.3 M NaCl and is the 
first eluting chromatographic fraction that has SAP actiyit>\ The second eluiing 
fraction is SO-5. 

Because more than one amino group on SAP may react with the 
35 succinimidyl moiety, it is possible that more than one amino group on the surface of the 
protein is reactive. This creates the potential for heterogeneity even if mono- 
derivatized SAP is used. This soiure of heterogeneity has been solved by the 
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conjugating modified SAP expressed in £. coli that has an additional cysteine inserted 
in the coding sequence, preferably within 10 or 20 amino acids of either the C-terminus 
or N-terminus. The preferred molecule has the Met-Cys inserted at the N-terminus. 

As discussed above, muteins of saporin that contain a Cys at or near the 

5 amino or carboxyl terminus can be prepared. Thus, instead of derivatizing saporin to 
introduce a sulfliydryl, the saporin can be modified by the introduction of a cysteine 
residue into the SAP such that the resulting modified saporin protein reacts with a 
VEGF monomer or a linker (and then to a VEGF monomer) to produce a conjugate. It 
is understood that, as discussed above and below, in order for the cytotoxic conjugates 

10 herein to bind to VEGF receptors most effectively, the VEGF portion of the conjugate 

should be dimerized. 

Preferred loci for introduction of a cysteine residue include the N- 
terminus region, preferably within about one to twenty residues, more preferably one to 
about ten residues, from the N-terminus of the cytotoxic agent, such as SAP. For 
1 5 expression of SAP in the bacterial host systems herein, it is also desirable to add DNA 
encoding a methionine linked to the DNA encoding the N-terminus of the saporin 
protein. DNA encoding SAP has been modified by inserting a DNA encoding Met-Cys 
(ATG TGT or ATG TGC) at the N-terminus immediately adjacent to the codon for first 

residue of the mature protein. 
20 Muteins in which a cysteine residue has been added at the N-terminus 

and muteins in which the ammo acid at position 4 or 1 0 has been replaced with cysteine 
have been prepared by modifying the DNA encoding saporin (see. Examples). 
Preferably, saporin has a cysteine added at the -1 position (see Example 3). The 
modified DNA may be expressed and the resulting saporin protein purified, as 
25 described herein for expression and purification of the resulting SAP. The modified 
saporin can then be reacted with a VEGF, preferably a VEGF dimer, to form disulfide 
linkages between the VEGF dimer and the cysteine residue on the modified SAP. 

Typically, SAP is derivatized by reaction with SPDP. This results in a 
heterogeneous population. For example, SAP that is derivatized by SPDP to a level of 
30 0.9 moles pyridine-disulfide per mole of SAP includes a population of non-derivatized. 
mono-derivatized and di-derivatized SAP. Methods for isolation of mono-derivatized 
saporin are described, for example, in Lappi et al. ( 1993) Anal. Biochem. 2 J 2:446-45 1 . 
copending U.S. Application Serial No. 08/099.924). The methods rely on the charge 
differences among the three species of SAP that are produced upon reaction of one ore 
35 more lysines in saporin with SPDP. The mono-derivatized saporin species is purified 
by Mono-S cation exchange chromatography and pooling of the fractions that contain 
the monoderivatized species. Briefly, the initial eluting peak is composed of SAP that 
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is approximately di-derivatized; the second peak is mono-derivatized and the third peak 
shows no derivatization. The di-derivatized materia] accounts for 20% of the three 
peaks; the second accounts for 48% and the third peak contains 32%. Fractions that 
have a ratio of SPDP to SAP greater than 0.85 but less than 1.05 are pooled, dialyzed 
5 against an appropriate buffer, such as 0.1 M sodium chloride, 0.1 M sodium phosphate, 
pH 7.5, used for coupling to a linker, to a VEGF monomer, a VEGF dimer, a VEGF 
monomer with linker, or a VEGF dimer with linker. 

The resulting preparation, although more uniform, still contains some 
heterogeneity because native saporin as purified from the seed is a mixture of four 

10 isoforms, as judged by protein sequencing (see, e.g., PCT Application WO 93/25688 
(Serial No. PCT/US93/05702), United States Application Serial No. 07/901,718; see 
also, Montecucchi et al. (1989) JrtL J, PepL ProL Res, ii:263-267; Maras et al. (1990) 
Biochem, InternaL 27:631-638; and Barra et al. (1991) Biotechnol Appl Biochem, 
75:48-53). This creates some heterogeneity in the conjugates, since the reaction with 

1 5 SPDP probably occurs equally within each isoform. This source of heterogeneity can 
be removed by using saporin expressed in E, coli, 

c. Chemical conjugarion of a VEGF protein to linkers and 
targeted agents 

20 The VEGF monomers are preferably linked via non*essential cysteine 

residues to the linkers or to the targeted agent. VEGF that has been modified by 
introduction of a Cys residue at or near one temiinus, preferably the N-terminus is 
preferred for usie in chemical conjugation (see Examples for preparation of such 
modified VEGF), For use herein, the VEGF, preferably, is dimerized prior to linkage 

25 to the linker and/or targeted agent. Methods for coupling proteins to the linkers^ such 
as the heterobifunctional agents, or to nucleic adids, or to proteins are known to those of 
skill in the art and are also described herein. 

Methods for chemical conjugation of proteins are known to those of skill 
in the art. The preferred methods for chemical conjugation depend on the selected 

30 components, but preferably rely on disulfide bond formation. For example, if the 

3 

targeted agent is SPDP-derivatized saporin, then it is advantageous to dimerize the 
VEGF moiet>' prior coupling or conjugating to the dcrivatized saporin. 

2. Fusion protein of a VEGF polypeptide and targeted agent 

35 Expression of DNA encoding a fusion of a VEGF protein linked to the 

targeted agent results in a more homogeneous preparation of cytotoxic conjugates and 
is suitable for use, when the selected targeting agent and linker are polypeptides. 
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Aggregate formation may be reduced in preparations contaming the fusion proteins by 
modifying the VEGF, particularly, VEGFi65, VEGFi89 and VEGF206> which contain 
nonessential cysteines in the heparin binding domain and/or the targeted agent to 
prevent cysteine-cysteine interactions between each conjugate or decrease secondary 
5 structure. 

a. Expression of VEGF 

DNA encoding VEGF peptides and/or the amino acid sequences of 
VEGFs are known to those of skill in this art (see, e.g., SEQ ID NOs, 18-28). DNA 

10 may be prepared synthetically based on the amino acid sequence or known DNA 
sequence of a VEGF or may be isolated using methods known to those of skill in the art 
or obtained from conunercial or other sources known to those of skill in this art. 

It is also understood that substitutions in codons by virtue of the 
degeneracy of the genetic code are encompassed by DNA encoding such VEGF. DNA 

1 5 encoding the VEGF polypeptide may be obtained from any source known to those of 
skill in the art; it may be isolated using standard cloning methods^ synthesized or 
obtained from commercial sources, prepared as described in any of the patents and 
publications noted herein. 

Such DNA may then be mutagenized using standard methodologies to 

20 delete, replace any cysteine residues, as described herein, that are not required for 
dimerization and receptor binding and internalization, or insert cysteine residues for 
chemical conjugation (see, SEQ ID NOs; 86-89). As necessary, the identity of non- 
essential cysteine residues may be determined empirically, by deleting, inserting and/or 
and replacing a cysteine residue and ascertaining whether the resulting VEGF. with the 

25 deleted cysteine form aggregates in solutions containing physiologically acceptable 
buffers and salts. Loci for insertion of cysteine residues may also be determined 
empirically. Generally, regions at or near (within 20, preferably 1 0 amino acids) the 
C- or, preferably, the N-terminus. 

As discussed above, binding to a VEGF receptor followed by internal i- 

30 zation are the activities required for a VEGF protein to be suitable for use herein. A 
lest of such activit>% which reflects the ability to bind to VEGF receptors and to be 
internalized, is the ability of a conjugate containing VEGF (e.g.. VEGF-saporin) to 
inhibit proliferation of cells, such as vascular endothelial cells, including bovine or 
human aortic endothelial cells, that bear VEGF receptors. Any VEGF polypeptide that 

35 possesses such ability is intended for use herein. 

The DNA encoding the conjugate can be inserted into a plasmid and 
expressed in a selected host. Multiple copies of the DNA encoding the VEGF-targeted 
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agent chimera or VEGF-cytotoxic agent chimera can be inserted into a single plasmid 
in operative linkage with one promoter. When expressed, the resulting protein will be a 
VEGF-cytotoxic agent multimer. Typically two to six copies of the chimera are 
inserted into a plasmid, preferably in a head to tail orientation. Alternatively, one or 
5 more copies of the VEGF-iargeted agent chimera is inserted under the control of a first 
promoter in a plasmid and one or more copies encoding a VEGF polypeptide is inserted 
imder the control of a second proinoter in the plasmid or into a second plasmid. The 
resulting piasmid(s) is (are) introduced into a host and cultured under conditions in 
which the promoter is active and the conjugated and a VEGF polypeptide are produced. 
1 0 The resulting preparation is treated to permit refolding of the VEGF and dimerization. 
Conjugates containing VEGF dimers are isolated. 

b. Preparation of muteins for recombinant production of the 
conjugates 

15 For recombinant expression using the methods herein, all cysteines in 

the VEGF peptide that are not required for biological activity can be deleted or 
replaced; and for use in the chemical conjugation methods herein, all except for one of 
these cysteines, which will be used for chemical conjugation to the cytotoxic agent, can 
be deleted or replaced. Human (and the corresponding bovine protein) VEGF] 21 has 

20 nine cysteine residues and VEGF] 65 and VEGF] 89 have 16 cysteine residues per 
monomer. Each of the nine cysteines may be replaced and the resulting mutein tested 
for the ability to bind to VEGF receptors and to be internalized as described herein. 
Alternatively, the resulting mutein-encoding DNA is used as part of a construct 
containing DNA encoding the cytotoxic agent linked to the VEGF^encoding DNA. The 

25 cdnstmct is expressed in a suitable host cell and the resulting protein tested for the 
ability to bind to VEGF receptors and internalize the cytotoxic agent. As long this 
ability is retained the mutein is suitable for use herein. 

c. DNA constructs and expression of the constructs 

30 DNA encoding VEGF conjugates is expressed in any suitable host, 

particularly bacterial and insect hosts. Methods and plasmids for such expression are 
set forth in the examples (see, also Table 3). 

Using the methods and materials described above and in the examples 
numerous chemical conjugates and fusion, proteins have been synthesized. These 
35 include the constructs set forth in Table 3, below. 

Particular details of the syntheses of the constructs are set forth in the 
EXAMPLES. The constructs have been synthesized and have been or can be inserted 
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into plasmids including pET 1 1 (with and without the T7 transcription terminator), pET 
12 and pET 15 (Invitrogen, San Diego), pPfX and pKK223-3 (Pharmacia, P.L.) and 
derivatives of pKK223-3. The resulting plasmids have been and can be transformed 
into bacterial hosts including BL21, BL21(DE3), HMS174(DE3), (Novagen, Madison, 
5 WI) and N4830(cI857) (see, Gottesman et al. (1980) J. Mol. Biol 140:51-15, 
commercially available from PL Biochemicals, Inc., also, see. e.g., U.S. Patent Nos. 
5,266,465, 5,260,223, 5,256,769, 5,256,769, 5,252,725, 5^50,296, 5,244,797, 
5,236,828, 5,234,829, 5,229,273, 4,798,886, 4,849,350, 4,820,631 and 4,780,313) or 
N99CI''' for pPu-?^. N4830 harbors a heavily deleted phage lambda prophage carrying 
10 the mutant cI857 temperature sensitive repressor and an active N gene. The constructs 
have also been introduced in baculovirus vector sold commercially under the name 
pBlueBaclII (Invitrogen, San Diego CA; see the Invitrogen catalog; see, also, Vialard 
et al. (1990) J. Virol 64:31i U.S. Patent No. 5,270,458; U.S. Patent No. 5,243,041 ; and 
published International PCT Application WO 93/10139, which is based on U.S. patent 
15 application Serial No. 07/792,600. The pBlueBacIII vector is a dual promoter vector 
and provides for the selection of recombinants by blue/white screening as this plasmid 
contains the p-galactosidase gene (lacZ) under the control of the insect recognizable 
ETL promoter and is inducible with IPTG. The baculovirus vector is then co- 
transfected with vrild type virus into insect host cells Spodoptera frugiperda (sf?; see, 
20 e.g., Luckow et al. {\9%Z) Bio/technology 6:47-55 and U.S. Patent No. 4,745,051). 

TABLES 



Plasmid(s) 



*** 



PZ50B1 
PZ51B1 



PZ51E1 



PZ52B1 
PZ52E1 
PZ70B1 
PZ71B1 
PZ115B1 
PZ116B1 
PZ72B1 
PZ73B1 
PZ74B1 
PZ74F5 



Description of Fusion Protein 



SAP CYS-1 
SAP CYS+4 



SAP CYS-^4 
SAP CYS-HO 
SAPCYS-HO 
VEGF 1 65(-signal seq . ) 
VEGFi65(-^signal seq.) 
VEGFl2l(^^signal seq.) 

VEGFi 21 (-signal seq.) 

VEGFl65-AlaMet-SAP (-signal seq.) 
VEGFl6S-AlaMet-SAP (-^-signal seq.) 



S AP-AlaMet-VEGF 1 65 
S AP-AlaMet-VEGF 1 65 



Fusion 
Protein 
Name 

FPSl 
FPS2 



FPS2 
FPS3 
FPS3 

FPVl 
FPVl 



FPVSl 
FPVSl 



FPSVl 
FPSVl 
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PZ75B1 


SAP-(Gly4Ser)4-VEGF 1 65 


FPSV2 


PZ75F5 


S AP-(Gly4Ser)4-VEGF 1 65 


FPSV2 


PZ76B1 


S AP-AlaMet- VEGF 1 2 1 


FPSV3 


PZ76F5 


S AP-AlaMet-VEGF 1 2 1 


FPSV3 


PZ77B1 


SAP-G4Sx4-VEGFi21 


FPSV4 


PZ78F5 


S AP-G4Sx4- VEGF 1 2 1 


FPSV4 


PZ79B1 


S AP-AlaMet- VEGF 1 21 (Gly4Ser)-VEGF ] 2 1 


FPSVVl 


PZ79F5 


S AP-AlaMet- VEGF 1 2 1 (Gly Ser)- VEGF 1 2 1 




PZ80B1 


SAP-AlaMet-VEGFi 2 1 (GIy4Ser)2-VEGF 12 1 


FPSVV2 


1 PZ81B1 


SAP-AlaMet-VEGF 1 65(Gly4Ser> VEGF 1 65 


FPSVV3 


PZ81F5 


SAP-AlaMet-VEGFi65(GlySer)-VEGFi65 




PZ82B1 


SAP-AlaMet-VEGF 1 65(GIy4Ser)2-VEGF 1 65 


FPSVV4 


PZ83B1 


S AP-(Gly4Ser)-VEGF 1 2 1 (Gly4Ser)- VEGF 1 2 1 


FPSVV5 


PZ84B1 


S AP-(Gly4Ser)2- VEGF 1 2 1 (Gly4Ser)2- 

VEGF121 


FPSVV6 

■ 


PZ85B1 


S AP-(Gly4Ser)-VEGF 1 65{Gly4Ser)-VEGF 1 65 


FPSVV7 1 


PZ85F5 


S AP-(GlySer)-VEGF \ 65(Gly4Ser)-VEGF ] 65 




1 PZ86B1 
1 


S AP-(Gly4Ser)2-VEGF 1 65(GIy4Ser)2- 

VEGF165 


FPSVV8 


1 PZ871 


VEGFi2l(Baculovirus) Viral Stock 


FPV2 


1 PZ87I7 


VEGF121 


FPV2 1 


1 PZ88I 


VEGFi65(Baculovirus) Viral Stock 


FPVl 


PZ88I7 


VEGF165 


FPVl 


PZ891 


VEGF 1 2 1 C YS+2(Baculovirus) Viral Stock 


FPV3 


1 PZ89I7 


VEGF121CYS+2., . 


FPV3 


1 PZ901 


VEGF 1 21 CYS+4(Baculovirus) Viral Stock 


FPV4 


1 PZ90I7 


VEGF121CYS+4 


FPV4 


PZ911 


VEGFi65CYS+2(Baculovinis) Viral Stock 


FPV5 


PZ91I 


VEGF165CYS+2 


FPV5 1 


PZ921 

•I ^ M 


VEGF 1 65C YS+4 (B ACULOVIRUS) Viral 

Stock 


FPV6 

• 


PZ92I7 


VEGF165CYS+4 


FPV6 


1 PZ93F5 


MetVEGFi2i 


FPV2 


1 PZ94F5 


Met VEGF 165 


FPVl 1 


PZ95B1 


pel B-SAP-AlaMet-Vi2HG45)-Vi21 


FPSVVl 1 


PZ96B1 


ompA-SAP-AlaMet-V 1 2 1 -(G45)-V 1 2 1 


FPSVVl 1 


PZ97B1 


ompT-S AP-AlaMet-V 1 2 1 -(G45)-V 1 2 1 


FPSVVl 1 


PZ98BI 


phoArS AP-AlaMet- V 1 2 1 -( G45 )- V 1 2 1 


FPSVVl 1 


PZ99B1 


pelB-S AP-AlaMet-V 1 65-(G45)-V 1 65 


FPSVV3 


PZIOOBI 


ompA-S AP-AlaMet-V 1 65-(G45)-V 1 65 


FPSVV3 


PZIOIBI 


ompT-SAP-AlaMet-V 1 65-(G45)-V 1 65 


FPSVV3 


PZ102B1 


phoA-S AP-AlaMet-V 1 65-(G45)-V 1 65 


FPSVV3 


1 PZ103B1 


SAP-VEGF exon 3.4.5 


FPSV5 


1 PZ104B1 


SAP-VEGF exon 6.7.8 


FPSV7 
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1 PZ105B1 


SAP-VEGF exon 3,4,5,6 




1 PZ 10611 


pMAL-p2=I SAP-VEGF exon 3,4,5 


FPSV5 1 


1 PZ10711 


pMAL-p2=I SAP-VEGF cxon 3,4,5,6 


FPSV6 


1 PZ108I1 


pMAL-p2=l SAP-VEGF exon 6,7,8 


FPSV7 1 


1 PZlllJl 


PGEX-SX=SAP VEGF cxon 3,4,5 


FPSV5 


1 PZ112J1 


PGEX-SX=SAP VEGF exon 3,4,5.6 


FPSV6 


1 PZ113J1 


PGEX-SX=SAP VEGF exon 7,8 


FPSV8 


1 PZ114J1~" 


PGEX-SX=SAP VEGF exon 6,7,8 


FPSV7 "1 



Details regarding these constructs are described in co-pending U.S. Application 
Serial Nos. 08/213,446 and 08/213,446, International PCT Application WO 
53 1 89, and PCT Appln. US 94/ , filed July 27, 1 994 
N/A = not applicable 

The plasmids, such as PZlAl are designated with (i) a PZnumber (PZl), 
followed by (ii) a letter (A), and optionally (iii) followed by a number (1). The 
key to these designations: (i) PZnumber - refers to the construct number, (ii) the 
next letter refers to the plasmid into which the construct was cloned, A=pET 1 1 
10 without the T7 transcription terminator, B=pET 11 with the T7 transcription 

terminatorv c=pET 13, D=pET 12, E=pET 15b, F=pPLX, G=pKK 223-3, 
H=PRZ 1 (pKK223-3+Kan'^), I=pBlueBac III, J=PRZ2 (pKK223-3 + Kan + 
lad gene) and (ii) the optional number (or letter) refers to the bacterial strain 
(number) or insect host (letter) in which the plasmid was introduced. 
15 i=BL2l(DE3),2=BL21(PE3)+pLYSS;3=HMSl74(DE3), 

4=HMS174(DE3)+pLYS S, 5=N4830(cl8576) and 7=NovaBlue. 

A particulariy preferxed'vector for expressing VEGF or VEGF-cytotoxic 
agent fusion protein is pPt-X inducible expression vector (Pharmacia Biotech, Uppsala, 
Sweden). As described above, this vector contains the tightly regulated leftward 
20 promoter of bacterophage X, which is controlled by the cl repressor. The promoter is 
temperature inducible in a bacterial hosU such as N4830-1, which contains the ts 
repressor cI857. Upon induction, the expressed protein is expressed and 
coinpartmentalized into inclusion bodies. The inclusion bodies are released by lysing 
the cells, such as with lysozyme digestion and sonication. The insoluble fraction. 
•>5 containing the inclusion bodies, is isolated by centrifugation. The inclu^on bodies are 
solubilized by a strong denaturanU such as 6M guanidine-HCl or urea. The proteins are 
recovered from the supernatant following centrifugation by dilution into a buffer 
containing lOOmM Tres, lOmM EDTA, 1% monothioglycerol 025M L-argimne. 
pH9 5 Other equivalem components may be readily substimted as long as the pH is 
30 basic and a reduction agem is present- Tlie dilution is performed slowly and die 



wo 96/06641 



63 



mixture stirred for up to 2 hours. Refolding the protein is accomplished by dialysis of 
the protein into buffer, such as PBS, pH 8.8. 

F. Properties and use of the resulting chemical conjugates and fusion proteins 

5 The conjugates provided herein can be used in vitro to identify cells, 

particularly tiunor cells that express receptors to which the conjugate selectively binds 
and which internalized the conjugates. The cells are contacted with the conjugates and 
assayed for proliferation. Cells in which proliferation is inhibited express VEGF 
receptors. If such cells are derived from a tumor, such tumor will be a candidate for 
1 0 treatment with the VEGF conjugate. If such cells are a cell line, such cell line will be 
useful in drug screening assays for identification of compounds that modulate the 
activity of VEGF receptors (see. e.g.. U.S. Patent Nos. 5,208J45, 5,071,773, 
4,981 ,784, 4,603,106, which describe such assays for other receptors). 

15 G. Formulation and administration of pharmaceutical compositions 

The conjugates herein may be formulated into pharmaceutical 
compositions suitable for topical, local, intravenous and systemic application. Effective 
concentrations of one or more of the conjugates are mixed with a suitable 
pharmaceutical carrier or vehicle. The concentrations or amounts of the conjugates that 

20 are effective requires delivery of an amount, upon administration, that ameliorates the 
symptoms or treats the disease. Typically, the compositions are formulated for single 
dosage administration. Therapeutically effective concentrations arid amounts may be 
determined empirically by testing the conjugates in known in vitro and in vivo systems, 
such as those described here; dosages for humans or other aninuds may then be 

25 extrapolated therefix>m. 

Upon mixing or addition of the conjugate(s) with the vehicle, the 
resulting mixture may be a solution, suspension, emulsion or the like. The form of the 
resulting mixture depends upon a number of factors, including the intended mode of 
administration and the solubility of the conjugate in the selected carrier or vehicle. The 

30 effective concentration is sufficient for ameliorating the symptoms of the disease, 
disorder or condition treated and may be empirically detenriined based upon in vitro 
and/or in vivo data, such as the data from the mouse xenograft model for tumors or 
rabbit ophthalmic model. If necessary', pharmaceutically acceptable salts or other 
derivatives of the conjugates may be prepared. 

35 Pharmaceutical carriers or vehicles suitable for administration of the 

conjugates provided herein include any such carriers known to those skilled in the art to 
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be suitable for the particular mode of administration. As u^d herein, pharmaceutically 
acceptable salts, esters or other derivatives of the conjugates include any salts, esters or 
derivatives that may be readily prepared by those of skill in this art using known 
methods for such derivatization and that produce compounds that may be administered 
5 to animals or humans without substantial toxic effects and that either are 
pharmaceutically active or are prodrugs. A prodmg is a compound that, upon in vivo 
administration, is metabolized or otherwise converted to the biologically, 
pharmaceutically or therapeutically active form of the compound. To produce a 
prodrug, the pharmaceutically active compound is modified such that the active 

1 0 compoimd will be regenerated by metabolic processes. The prodrug may be designed 
to alter the metabolic stability or the transport characteristics of a drug, to mask side 
effcicts or toxicity, to improve the flavor of a drug or to alter other characteristics or pro- 
perties of a drug. By virtue of knowledge of pharmacodynamic processes and drug 
metabolism in vivo, those of skill in this art, once a pharmaceutically active compound 

15 is known, can design prodrugs of the compound {see, e.g,^ Nogrady (1985) Medicinal 
Chemistry A Biochemical Approach^ Oxford University Press, New York, pages 388- 
392). In addition, the conjugates may be formulated as the sole pharmaceutically active 
ingredient in the composition or may be combined with other active ingredients. 

The conjugates can be administered by any appropriate route, for 

20 example, orally, parcnterally, intravenously, intradermally, subcutaneously, or 
topically, in liquid, semi-liquid or solid form and are formulated in a manner suitable 
for each route of administration. Preferred modes of administration depend upon the 
indication treated. Dermatological and ophthalmologic indications will typically be 
treated locally; whereas, tumors and vascular proliferative disorders, will typically be 

25 treated by systemic, intradermal, intralesional, or intramuscular modes of 
administration. 

The conjugate is included in the pharmaceutically acceptable carrier in 
an amount sufficient to exert a therapeutically useful effect in the absence of 
undesirable side effects on the patient treated. It is understood that number and degree 

30 of side effects depends upon the condition for which the conjugates are administered. 
For example, certain toxic and undesirable side effects are tolerated when treating life- 
threatening illnesses, such as tumors, that would not be tolerated when treating 
disorders of lesser consequence. 

The concentration of conjugate in the composition will depend on 

35 absorption, inacttvation and excretion rates thereof, the dosage schedule, and amount 
administered as well as other factors known to those of skill in the art. 
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As iised herein an effective amount of a compound for treating a 
particular disease is an amoimt that is sufiicient to ameliorate, or in some manner 
reduce the symptoms associated with the disease. Such amount may be administered as 
a single dosage or may be administered according to a regimen, whereby it is effective. 
5 The amount may cure the disease but, typically, is administered in order to ameliorate 
the symptoms of the disease. Repeated administration may be required to achieve the 
desired amelioration of symptoms. 

Typically a therapeutically effective dosage should produce a serum 
concentration of active ingredient of from about 0.1 ng/ml to about 50-100 ^g/ml. The 

10 pharmaceutical compositions typically should provide a dosage of from about 0.01 mg 
to about 100 - 2000 mg of conjugate, depending upon the conjugate selected, per 
kilogram of body weight per day. Typically, for intravenous or systemic treatment a 
daily dosage of about between 0.05 and 0.5 mg/kg should be sufficient. Local 
application for ophthalmic disorders should provide about I ng up to 100 jig, preferably 

1 5 about 1 |ig to about 1 0 p.g, per single dosage administration. It is understood that the 
amotmt to administer will be a function of the conjugate selected, the indication treated, 
and possibly the side effects that will be tolerated. Dosages can be empirically 
determined using recognized models for each disorder. 

The active ingredient may be administered at once, or may be divided 

20 into a number of smaller doses to be administered at intervals of time. It is understood 
that the precise dosage and duration of treatment is a function of the disease being 
treated and may be determined empirically using known testing protocols or by 
extrapolation from in vivo or in vitro test data. It is to be noted that concentrations and 
dosagie values may also vary with the severity of the condition to be alleviated. It is to 

25 be further understood that for aiiy particular subject, Specific dosage regimens should 
be adjusted over time according to the individual need and the professional judgment of 
the person administering or supervising the administration of the comp>ositions« and that 
the concentration ranges set forth herein are exemplary only arid are not intended to 
limit the scope or practice of the claimed compositions. 

30 Solutions or suspensions used for parenteral, intradermal, subcutaneous, 

or topical application can include any of the following components: a sterile diluenu 
such as water for injection, saline solution, fixed oil, polyethylene glycol, glycerine, 
propylene glycol or other synthetic solvent; antimicrobial agents, such as benzyl 
alcohol and methyl parabens; antioxidants, ^uch as ascorbic acid and sodium bisulfite; 

35 chelating agents, such as ethylenediaminetetraacetic acid (EDTA); buffers, such as 
acetates, citrates and phosphates; and agents for the adjustment of tonicity such as 
sodium chloride or dextrose. Parental preparations can be enclosed in ampules. 
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disposable syringes or miiltiple dose vials made of glass, plastic or other suitable 
material. If administered intravenously, suitable carriers include physiological saline or 
phosphate buffered saline (PBS), and solutions containing thickening and solubilizing 
agents, such as glucose, polyethylene glycol, and polypropylene glycol and mixtures 

5 thereof. Liposomal suspensions may also be suitable as pharmaceutically acceptable 
carriers. These may be prepared according to methods known to those skilled in the art. 

The conjugates may be prepared with carriers that protect them against 
rapid elimination from the body, such as time release formulations or coatings. Such 
carriers include controlled release formulations, such as, but not limited to, implants 

10 and microencapsulated delivery systems, and biodegradable, biocompatible polymers, 
such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, polyorthoesters, 
polylactic acid and others. These are particularly useful for application to the eye for 
ophthalmic indications following or during surgery in which only a single 
administration is possible. Methods for preparation of such formulations are known to 

1 5 those skilled in the art. 

The conjugates may be formulated for local or topical application, such 
as for topical application to the skin and mucous membranes, such as in the eye, in the 
form of gels, creams, and lotions and for application to the eye or for intracistemal or 
intraspinal application. Such solutions, particularly those intended for ophthalmic use, 
20 may be formulated as 0.01% -10% isotonic solutions, pH about 5-7, with appropriate 
salts. The ophthalmic compositions niay also include additional components, such as 
hyaluronic acid. The conjugates may *e formulated as aerosols for topical application 
{see. e.g., U.S. Patent Nos. 4,044,126, 4,414,209, and 4364,923). 

If oral administration is desired, the conjugate should be provided in a 
25 composition that protects it from the acidic environment of the stomach. For example, 
the composition can be formulated in an enteric coating that maintains its integrity in 
the stomach and releases the active compound in the intestine. The composition may 
also be formulated in combination with an antacid or other such ingredient. 

Oral compositions will generally include an inert diluent or an edible 
30 carrier and may be compressed into tablets or enclosed in gelatin capsules. For the 
purpose of oral therapeutic administration, the active compound or conipounds can be 
incorporated with excipients and used in the form of tablets, capsules or troches. 
Pharmaceutically compatible binding agents and adjuvant materials can be included as 
part of the composition. When the dosage unit form is a capsule, it can contain, in 
35 addition to material of the above type, a liquid carrier such as a fatty oil. In addition, 
dosage unit forms can contain various other materials which modify the physical form 
of the dosage imit. for example, coatings of sugar and other enteric agents. The 
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conjugates can also be administered as a component of an elixir, suspension, syrup, 
wafer, chewing gum or the like. A syrup may contain, in addition to the active 
compounds, sucrose as a sweetening agent and certain preservatives, dyes and colorings 
and flavors. 

5 The active materials can also be mixed with other active materials that 

do not impair the desired action, or with materials that supplement the desired action, 
such as cis-platin for treatment of tumors. 

Finally, the compoimds may be packaged as articles of manufacture 
containing packaging material, one or more conjugates or compositions as provided 
10 herein within the packaging material, and a label that indicates the indication for ^^ich 
the conjiigate is provided. 

H* Therapeutic use of the VEGF conjugates 

The conjugates provided herein can be used in pharmaceutical 

15 compositions to treat VEGF-mediated pathophysiological conditions by targeting to 
cells that bear VEGF receptors and inhibiting proliferation of or causing death of the 
cells. Such pathophysiological conditions include, for example, certain tumors, such as 
Kaposi's sarcoma, renal cell carcinomas and highly vascularized tumors, rheumatoid 
arthritis, psoriasis and other hyperproliferative skin disorders. As used herein, a 

20 hyperproliferative skin disorder is a disorder that is manifested by a proliferation of 
endothelial cells of the skin coupled with an underlying vascular proliferation, resulting 
in a localized patch of scaly, horny or thickened skin or a tumor of endothelial origin. 
Such disorders include, but are hot limited to actinic and atopic dermatitis, toxic 
eczema, allergic eczema, psoriasis, skin cancers and other tumors, such as K^x>si*s 

25 sarcoma, angiosarcoma, hemangiomas, and other highly vascularized tumors, and 
vascular proliferative responses, such as varicose veinis. The treatment is effected by 
administering a therapeutically effective amount of the VEGF conjugate, for example, 
in a physiologically vehicle suitable for local or systemic application. In particular, for 
treatment of localized skin disorders the conjugate is formulated for topical, local or 

30 intralesional application to the skin and is applied topically, locally or intralesional. 

Treatment means any manner in which the symptoms of a conditions, 
disorder or disease are ameliorated or otherwise beneficially altered. Treatment also 
encompasses any pharmaceutical use of the compositions herein. Symptoms of a 
particular disorder are ameliorated by administration of a particular pharmaceutical 

35 composition and refers to any lessening, whether permanent or temporary • lasting or 
transient that can be attributed to or associated with administration of the composition. 
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The following examples are included for illustrative purposes only and 
are not intended to limit the scope of the invention. 



5 EXAMPLE 1 

Recombinant Production of Saporin 

A. Materials and methods 
1. Bacterial Strains 

10 E. coli strain JA221 (Ipp- hdsM+ trpE5 leuB6 lacY recAl F[lacl«> lac- 

pro*]) is publicly available from the American Type Culture Collection (ATCC), 
Rockville, MD 20852, under the accession number ATCC 33875. (JA221 is also 
available from the Northern Regional Research Center (NRRL), Agricultural Research 
Service, U.S. Department of Agriculture, Peoria, IL 61604, under the accession number 

15 NRRL B-1521 1; see, also, U.S. Patent No. 4,757,013 to Inouye; and Nakamura et al.. 
Cell 75:1109-1117, 1979). Strain INVla is commercially available from Invitrogen, 
San Diego, CA. 



20 



2. DN A Manipulations , 

The restriction and modification enzymes employed herein are 
commercially available in the U.S. Native saporin and rabbit polyclonal antiserum to 
saporin were obtained as, previously described in Lappi et al., Biochem. Biophys. Res. 
Comm. i29:934-942. RicinA chiin is commercially available from Sigma, 
Milwaukee, WI. Antiserum v»ras linked to Affi-gel 10 (Bio-Rad, EmeiyvUle, CA) 
25 according to the manufacturer's instructions. Sequencing was performed using the 
Sequenase kit of United States Biochemical Corporation (version 2.0) according to the 
manufacwrer's instructions. Miniprcparation and maxipreparation of plasmids. 
preparation of competent cells, transformation. Ml 3 manipulation, bacterial media. 
Western blotting, and ELISA assays were according to Sambrook et al.. (Molecular 
30 Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989). The purification of DNA fragments was done using the Geneclean 
II kit (Bio 101) according to the manufacturer's instructions. SDS gel electtophoresis 
was performed on a Phastsystem (Pharmacia). 

Western blotting was accomplished by transfer of the electrophoresed 
35 protein to nitrocellulose using the PhastTransfcr system, as described by the 
manufacturer. The antiserum to SAP was used at a dilution of 1:1000. Horseradish 
peroxidase labeled anti-IgG was used as the second antibody {see Davis et al., Basic 
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Methods In Molecular Biology^ New York, Elsevier Science Publishing Co., pp 1-338, 
1986). 

B. Isolation of DNA encoding saporin 
5 1. Isolation of genomic DNA and preparation of polymerase chain 

reaction (PCR) primers 

Saponaria qfficirialis leaf genomic DNA was prepared as described in 
Bianchi etal.. Plant Mol Biol 11:203-21 A, 1988. Primers for genomic DNA 
amplifications were synthesized in a 3 SOB automatic DNA synthesizer. The primer 
10 corresponding to the "sense" strand of saporin (SEQ ID NO. 1) includes an EcoK I 
restriction site adapter inunediately upstream of the DNA codon for amino acid -15 of 
the native saporin N-terminal leader sequence (SEQ ID NO. 1): 

5'-CTGCAGAATTCGCATGGATCCTGCTTCAAT-3'. 

The primer 5^CTGCAGAATTCGCCTCGTTTGACTACTTTG-3' (SEQ 
15 ID NO. 2) corresponds to the "antisense" strand of saporin and complements the coding 
sequence of saporin starting from the last 5 nucleotides of the DNA encoding the 
carboxyl end of the mature peptide. Use of this primer introduced a translation stop 
codon and an fcoRI restriction site after the sequence encoding mature saporin. 

20 2. Amplification of DNA encoding saporin 

Unfractionated Saponaria officinalis leaf genomic DNA (1 \x\) was 
mixed in a final volimie of 100 jil containijig 10 mM Tris-HCl (pH 8.3), 50 mM KCK 
0.01% gelatin, 2 mM MgClj, 0.2 mM dNTPs, 0.8 ^g of each primer. Next, 2.5 U TaqI 
DNA polymerase (Perkin Elmer Cetus) was added and the mixture was overlaid with 
25 30 ^1 of mineral oil (Sigma). Incubations were done in a DNA Thermal Cycler 
(Ericomp). One cycle included a denaturation step (94**C for 1 min.), an annealing step 
(60*^0 for 2 min.), and an elongation step (72*'C for 3 min.). After 30 cycles, a 10 ^1 
aliquot of each reaction was run on a 1 .5% agarose gel to verify the correct structure of 
the amplified product. 

30 The amplified DNA was digested with £coRI and subcloned into EcoK 

1-restricted M13mpl8 {see, Yanisch-Perron et al. (1985). Gene ii: 103). Single- 
stranded DNA fix)m recombinant phages was sequenced using oligonucleotides based 
on internal points in the coding sequence of saporin (see. Bennati et al., Eur J. 
Biochem. 183:465-410. 1989). Nine of the M13mpl8 derivatives were sequenced and 

35 compared. Of the nine sequenced clones, five had unique sequences, set forth as SEQ 
ID NOS. 3-7, respectively. The clones were designated M13mpl8-G4. -GK -G2. -G7, 
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and -G9. Each of these clones contains all of the saporin coding sequence and 45 
nucleotides of DNA encoding the native saporin N-terminal leader peptide. 

C. pOMPAG4 Plasmid Construction 

5 Ml 3 nipl8-G4, containing the SEQ ID NO. 3 clone from Example 

I.B.2., was digested with EcoK I, and the resulting fragment was ligated into the EcoR 
I site of the vector pIN-IIIompA2 {see, e.g., see, U.S. Patent No. 4,575,013 to Inouye; 
and Dufifaud et al., Meth, Em. 753:492-507, 1987) using the methods described in 
Example l.A.2. The ligation was accomplished such that the DNA encoding saporin, 

1 0 including the N-terminal extension, was frised to the leader peptide segment of the 
bacterial ompA gene. Hie resulting plasmid pOMPAG4 contains the Ipp promoter 
(Nakamura et al.. Cell J8:\ 109-1 117, 1987), the E, colt lac promoter operator sequence 
(lac O) and the E. coli ompA gene secretion signal in operative association with each 
other and with the saporin and native N-terminal leader-encoding DNA listed in SEQ 

15 ID NO. 3. The plasmid also includes the E. coli lac repressor gene (lac I). 

The Ml 3 mpl8-Gl, -G2, -G7, and -G9 clones obtained from Example 
1.B.2, containing SEQ ID NOS. 4-7 respectively, are digested with EcoK I and ligated 
into £coR I digested pIN-IIIompA2 as described for MI 3 mpI8-G4 above in this 
example. The resulting plasmids, labeled pOMPAGl, pOMPAG2, pOMPAG7, 

20 pOMPA9, are screened, expressed, purified, and characterized as described for the 

plasmid pOMPAG4. 

INVla competent cells were transformed with pOMPAG4 and cultures 
containing the desired plasmid structure were grown fimher in order to obtain a large 
preparation of isolated pGMPAG4 plasmid tising methods described in Example 1 . A.2. 

25 

D. Saporin expression in £1 coli 

The pOMPAG4 transformed E. coli cells were grown under conditions 

in which the expression of the saporin-containing protein is repressed by the lac 
repressor until the end of log phase of growth after which IPTG was added to induce 

30 expression of the saporin-encoding DNA. 

To generate a large-batch culture of pOMPAG4 transformed E. coli 
cells, an overnight culture (lasting approximately 16 hours) of JA221 £. coli cells 
transformed with the plasmid pOMPAG4 in LB broth {see e g.. Sambrook et al.. supra) 
containing 125 mg/ml ampicillin was diluted 1:100 into a flask containing 750 ml LB 

35 broth with 125 mg/ml ampicillin. Cells were grown at logarithmic phase shaking at 3T 
C until the optical density at 550 nm reached 0.9 measitfed in a spectrophotometer. 
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In the second step, saporin expression was induced by the addition of 
IPTG (Sigma) to a final concentration of 0.2 mM. Induced cultures were grown for 2 
additional hours and then harvested by centrifugation (25 min., 6500 x g). The cell 
pellet was resuspended in ice cold 1.0 M TRIS, pH 9.0, 2 niM EDTA (10 ml were 
5 added to each gram of pellet). The resuspended materia] was kept on ice for 20-60 
minutes and then centrifiiged (20 min., 6500 x g) to separate the periplasmic fraction of 
E, coli^ which corresponds to the supernatant^ from the intracellular fraction 
corresponding to the pellet. 

10 E* Purification of secreted recombinant saporin 

1. Anti-SAP immuno-afiinity purification 

The periplasmic fraction from Example l.D. was dtaiyzed against 
borate-buffered saline (BBS: 5 mM boric acid, 1.25 mM borax, 145 mM sodium 
chloride, pH 8.5). The dialysate was loaded onto an inununoaffrnity colunm 

15 (0.5 X 2 cm) of ahti-saporin antibodies, obtained as described in Lappi et al., Biochem, 
Biophys, Res, Comm., J 29:934-942, 1985, bound to Affr-gel 10 and equilibrated in 
BBS at a flow rate of about 0.5 ml/min. The colunm was washed with BBS until the 
absorbance at 280 nm of the flow*through was reduced to baseline. Next the colunm 
containing the antibody bound saporin was eluted with 1 .0 M acetic acid and 0.5 ml 

20 fractions were collected in tubes containing 0.3 ml of 2 M ammonium hydroxide, 
pH 10. The fractions were analyzed by ELISA (see, e.g., Sambrook ct al., supra). The 
peak fraction of the ELISA was analyzed hy Western blotting as described in Example 
1 .A.2 and showed a single band with a slightly higher molecular weight than native 
saporin. The fractions that contained saporin protein, as determined by the ELISA« 

25 were then pooled for frirther purification. 

2. Reverse phase high performance liquid chromatography 
purification 

To further purify the saporin secreted into the periplasm, the pooled 
30 fractions from Example I.E.I. were diluted 1:1 with 0.1% trifluoroacetic acid (TFA) in 
water and chromatographed in reverse phase high pressure liquid chromatography 
(HPLC) on a Vydac C4 colunm (Western Analytical) equilibrated in 2(yi^o acetonitrile. 
0.1% TFA in water. The protein was eluted with a 20 minute gradient to 60% 
acetonitrile. The HPLC produced a single peak that was the only area of 
35 immimoreactivity with anti-SAP antiserum when analyzed by a western blot as 
described in Example l.E.l. Samples were assayed by an ELISA. 
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Sequence analysis was perfonned by Edmah degradation in a gas-phase 
sequenator (Applied Biosystems) (see, e.g., Lappi et al., Biochem. Biophys, Res, 
Comm J 29:934'942y 1985). The results indicated that five polypeptides were obtained 
that differ in the length, between 7 and 12 amino acids, of the N-terminal saporin leader 

5 before the initial amino acid valine of the mature native saporin (SEQ ID N03: residue 
-12 through -7). All of the N-terminal extended variants retained cytotoxic activity. 
The size of the native leader is 1 8 residues, indicating that the native signal peptide is 
not properly processed by bacterial processing enzymes. The ompA signal was, 
however, properly processed. 

10 To obtain homogeneous saporin, the recombinantly produced saporin 

can be separated by size, 

F* Purification of intracellular soluble saporin 

To purify the cytosolic soluble saporin protein, the pellet from the 
1 5 intracellular fraction of Example 1 .E. above was resuspended in lysis buffer (30 mM 
TRIS, 2mM EDTA, 0.1% Triton X-lOO, pH 8.0, with 1 mM PMSF, 10 fig/ml 
pepstatin A, 10 jig aprotinin, 10 |ig/ml leupeptin and 100 ^g/ml lysozyme, 3.5 ml per 
gram of original pellet). To lyse the cells, the suspension was left at room temperature 
for one hour, then frozen in liquid nitrogen and thawed in a 37**C bath three times, and 
20 then sonicated for two minutes. The lysate was centrifuged at 1 1,500 x g for 30 min. 
The supernatant was removed and stored, the pellet was resuspended in an equal 
volume of lysis buffer, centrifuged as before, and this second supernatant was 
combined with the first. The pooled supematants were dialyzed versus BBS and 
chromatographed over the immunoaffinity column as described in Example 1 .E. 1 . This 
25 material also retained cytotoxic activity. 

G. Assay for cytotoxic activity 

The RIP activity of recombinant saporin was compared to the RIP 
activity of native SAP in an in vitro assay measuring cell-free protein synthesis in a 

30 nuclease-treated rabbit reticulocyte lysate (Promega). Samples of immtmoaffmity- 
purified saporin, obtained in Example I.E.I.. were diluted in PBS and 5 ^l of sample 
was added on ice to 35 ^l of rabbit reticulocyte lysate and 10 jil of a reaction mixture 
containing 0.5 jil of Brome Mosaic Virus RNA. 1 mM amino acid mixture minus 
leucine. 5 yiCi of tritiated leucine and 3 nl of water. Assay tubes were incubated 1 hour 

35 in a 30"*C water bath. The reaction was stopped by transferring the tubes to ice and 
adding 5 ^l of the assay mixture, in triplicate, to 75 nl of 1 N sodium hydroxide, 2.5% 
hydrogen peroxide in the wells of a Millititer HA 96-well filtration plate (Millipore). 
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When the red color had bleached from the samples* 300^1 of ice cold 25% 
trichloroacetic acid (TCA) were added to each well and the plate left on ice for another 
30 min. Vacuum filtration was performed with a Millipore vacuimi holder. The wells 
were washed three times with 300 pi of ice cold 8% TCA. After drying, the filter paper 
circles were pimched out of the 96-well plate and counted by liquid scintillation 
techniques. 

The ICjo for the recombinant and native saporin were approximately 20 
pM. Therefore, recombinant saporin-containing protein has ftill protein synthesis 
inhibition activity when compared to native saporin. 

EXAMPLE 2 

Recombinant Production of FGF-SAP Fusion Protein 

15 A. General Descriptions 

1. Bacterial Strains and Plasmids: 

£. coli strains BL21(DE3), BL21 (DE3)pLysS, HMS174(DE3) and 
HMS174(DE3)pLysS were purchased from Novagcn, Madison, WI. Plasmid pFC80, 
described below, has been described in the PGT Application No. WO 90/02800, except 

20 that the bFGF coding sequence m the plasmid designated pFC80 herein has the 
sequence set forth as SEQ ID NO. 12, nucleotides 1-465. The plasmids described 
herein may be prepared using pFC80 as a starting material or, alternatively, by starting 
with a fragment containing the cll ribosome binding site <SEQ ID NO. 15) linked to the 
FGF-encoding DNA (SEQ ID NO. 12). 

25 E. coli strain JA221 (Ipp- hdsM+ trpE5 leuB6 lacY recAl F[lacl<» lac- 

pro*]) is publicly available from the American Type Culture Collection (ATCC), 
Rockville, MD 20852, under the accession number ATCC 33875. (JA221 is also 
available from the Northern Regional Research Center (NRRL), Agricultural Research 
Service, U.S. Department of Agriculture, Peoria, IL 61604, under the accession number 

30 NRRL B-1521 1; see, also^ U.S. Patent No. 4,757,013 to Inouye; and Nakamura el al., 
Ce// 75:1 109-1 117, 1979). Strain INVla is commercially available from Invitrogen, 
San Diego, CA. 

2. DNA Manipulations 

35 Native SAP, chemically conjugated bFGF-SAP and rabbit polyclonal 

antiserum to SAP and FGF were obtained as described in Lappi et al., Biochem. 
Biophys. Res, Comm. 729:934-942, 1985, and Lappi et al., Biochem. Biophys,, Res. 
Comm. J 60:917-923, 1989. The pET System Induction Control was purchased from 
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Novagen, Madison, WI. The sequencing of the different constructions was done using 
the Sequenase kit of United States Biochemical Corporation (version 2.0). 
Minipreparation and maxipreparations of plasmids, preparation of competent cells, 
transformation. Ml 3 manipulation, bacterial media and Western blotting were 
5 performed using routine methods (see, e.g. , Sambrook et al„ supra). The purification of 
DNA fragments was done using the Geneclean II kit, purchased from Bio 101 . SDS gel 
electrophoresis was performed on a Phastsystem (Pharmacia). 

Rabbit polyclonal antiserum to SAP and FGF were obtained as 
described in Lappi et ai., Biochem. Biophys. Res. Comm. J 29:934-942, 1985, and Lappi 
10 et al., Biochem. Biophys.. Res. Comm. 160:917-923, 1989. The pET System Induction 
Control was purchased from Novagen, Madison, WI. Minipreparation and 
maxipreparations of plasmids, preparation of competent cells, transformation. Ml 3 
manipulation, bacterial media and Western blotting were performed using routine 
methods {see, e.g., Sambrook et al., supra). The purification of DNA fragments was 
1 5 done using the Geneclean II kit, purchased from Bio 101 . SDS gel electrophoresis was 
performed on a Phastsystem (Pharmacia). 

Western blotting was accomplished by transfer of the electrophoresed 
protein to nitrocellulose using the PhastTransfer system, as described by the 
manufacturer. Horseradish peroxidase labeled anti-IgO was used as the second 
20 antibody (see Davis et al., Basic Methods In Molecular Biology, New York, Elsevier 
Science Publishing Co., pp. 1 -338, 1 986). 



B. Construction of plasmids encoding FGF-SAP fusion proteins 

1. Construction of FGFM 13 that contains DNA encoding the CI 

25 ribosome binding site linked to FGF 

A Ncol restriction site was introduced into the SAP-encoding DNA the 
M13mpl8-G4 clone, prepared as described in Example l.B.2. by site-directed 
mutagenesis method using the Amersham in vi/ro-mutagenesis system 2.1. The 
oligonucleotide employed to create the Nco I restriction site was synthesized using a 
30 3 SOB automatic DNA synthesizer (Applied Biosystems) and is listed as: 

SEQ ID NO. 8 - CAACAACTGCCATGGTCACATC. 

This oligonucleotide containing the Nco 1 site replaced the original SAP- 
containing coding sequence at SEQ ID N0.3, nts 32-53. The resulting M13mpl8-G4 

derivative is termed mpNG4. 
35 In order to produce a bFGF coding sequence in wWch the stop codon 

was removed, the FGF-encoding DNA was subcloned into a Ml 3 phage and subjected 
to site-directed mutagenesis. Plasmid pFCSO is a derivative of pDS20 (see. e.g.. 
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Duesteret al.. Cell 50:855-864, 1982; see, also, U.S. Patent Nos. 4,914,027, 5,037,744, 
5,100,784, and 5,187,261; 5ee, a/50, PCT Application No. WO 90/02800; and European 
Patent Application No. EP 267703 Al), which is abnost the same as plasmid pKGlSOO 
{see, Bemardi et al., DNA Sequence 7:147-150, 1990; jee, also, McKenney et al. (1981) 
5 pp. 383-415 in Gene Amplification and Analysis 2: Analysis of Nucleic Acids by 
Enzymatic Methods^ Chirikjian et al. (eds.). North Holland Publishing Company, 
Amsterdam) except that it contains an extra 440 bp at the distal end of galK between 
nucleotides 2440 and 2880 in pDS20. Plasmid pKGlSOO includes the 2880 bp EcoK I- 
Pvu W of pBR322 that contains the contains the ampicillin resistance gene and an origin 
10 of replication. 

Plasmid pFG80 was prepared frcfm pDS20 by replacing the entire galK 
gene with the FGF-encoding DNA of SEQ ID NO. 12, inserting the trp promoter (SEQ 
ID NO. 14) and the bacteriophage lambda cll ribosome binding site (SEQ. ID No. 15; 
seCy e.g., Schwarz et al.. Nature 272:410, 1978) upstream of and operatively linked to 
15 the FGF-encoding DNA. The trp promoter can be obtained from plasmid pDR720 
(Pharmacia PL Biochemicals) or synthesized according to SEQ ID NO. 14. Plasmid 
pFC80, contains the 2880 hp EcoKL-BamHl fragment of plasmid pSD20, a synthetic 
Sal I'Nde I fragment that encodes the Trp promoter region (SEQ ID NO. 14): 

£coRl 

20 AATTCCCCTGTTGACAATTAATCATCGAACTAGTTAACTAGTACGCAGCrrGGCTGCAG 

and the cll ribosome binding site (SEQ ID NO. 1 5)): 

Sail ? Nde\ 

GTCGACCAAGCTTGGGCATACATTCAATCAATTGTTATCTAAGGAAATACTTACATATG 

The FGF-encoding DNA was removed from pFC80 by treating it as 
25 follows. The pFCSO plasmid was digested by //gal and Sail, which produces a 
fragment containing the cll ribosome binding site linked to the FGF-encoding DNA. 
The resiilting fragment was blunt ended with poll (Klenow's fragment) and inserted into 
M13mpl8 that had been opened by Smal and treated with alkaline phosphatase for 
blunt-end ligation. In order to remove the stop codon, an insert in the ORI minus 
30 direction was mutagenized, as described above, using the following oligonucleotide 
(SEQ ID NO. 9): GCTAAGAGCGCCATGGAGA. SEQ ID NO. 9 contains one nu- 
cleotide between the FGF carboxy terminal serine codon and a Nco I restriction site; it 

replaced the following wild type FGF encoding DNA having SEQ ID NO. 1 0: 

GCT AAG AGC TGA CCA TGG AGA 
35 Ala Lys Ser STOP Pro Trp Arg 

The resulting mutant derivative of M13mpl8. lacking a native stop 
codon after the carboxy terminal serine codon of bFGF, was designated FGFM13. The 
mutagenized region of FGFM 1 3 contained the correct sequence (SEQ ID NO. 1 1 ). 
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2. Preparation of plasmids pFS92 (PZl A), PZIB and PZIC that 
encode the FGF-SAP fusion protein (FPFSl) 

a. Plasmid pFS92 (also designated PZl A) 

5 Plasmid FGFM13 was cut with Ncol and Sad to yield a fragment 

containing the cll ribosome binding site linked to the bFGF coding sequence with the 

stop codon replaced. 

The M13mpl8 derivative mpNG4 containing the saporin coding 
sequence was also cut with restriction endonucleases Nco I and Sac I, and the bFGF 

10 coding fragment fmm FGFM13 was inserted by ligation to DNA encoding the fusion 
protein bFGF-SAP into the M13mpl8 derivative to produce mpFGF-SAP, which 
contains the cll ribosome bmding site linked to the FGF-SAP fiision gene. The 
sequence of the fusion gene is set forth in SEQ ID NO. 12 and indicates that the FGF 
protein carboxy terminus and the saporin protein amino terminus are separated by 6 

15 nucleotides (SEQ ID NOS. 12 and 13, nts 466-471) that encode two amino acids Ala 
Met. 

Plasmid mpFGF-SAP was digested with Xbal and EcoR I and the 
resulting fragment containing the bFGF-SAP coding sequence was isolated and ligated 
into plasmid pET-1 la (available from Novagen, Madison, WI; for a description of the 
20 plasmids see U.S. Patent No. 4,952,496; see, also. Studier et al., Meih. Enz. 185:60-%% 
1990; Studier et al., J, MoL Biol. 7«P: 113-130, 1986; Rosenberg et al.. Gene 56:125- 
1 35, 1987) that had also been treated With EcoR I and Xba I. The resultinig plaanid was 
designated pFS92. It was renamed PZl A. . 

Plasmid pFS92 (or PZl A) contains DNA the entire basic FGF protein 
25 (SEQ ID NO. 12), a 2-amino acid long connecting peptide, and amino acids 1 to 253 of 
the mature SAP protein. Plasmid pFS92 also includes the cll ribosome binding site 
linked to the FGF-SAP fusion protein and the T7 promoter region from pET-1 la. 

£. coli strain BL21(DE3)pLysS (Novagen, Madison WI) was 
transfonned with pFS92 according to manufacturer's instructions and the methods 
30 described in Example 2.A.2. 

b. Plasmid PZIB 

Plasmid pFS92 was digested with EcoR I, the ends repaired by adding 
nucleoside triphosphates and Klenow DNA polymerase, and then digested with Nde 1 to 
release the FGF-encoding DNA without the cll ribosome binding site. This fiagment 
35 was ligated into pET 1 l a. which had been BamH I digested, treated to repair the ends. 
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and digested with Nde L The resulting plasmid was designated PZIB. PZIB includes 
the T7 transcription terminator and the pET-I la ribosome binding site, 

E. coll strain BL21(DE3) (Novagen, Madison WI) was transformed with 
PZIB according to manufacturer's instructions and the methods described in Example 

c. Plasmid PZIC 

Plasmid PZIC was prepared from PZIB by replacing the ampicillin 
resistance gene with a kanamycin resistance gene. 

d. Plasmid PZID 

1 0 Plasmid pFS92 was digested with £coR I and Nde I to release the FGF- 

encoding DNA without the cll ribosome binding site and the and the ends were 
repaired. This fragment was ligated into pET 1 2a, which had been BamH I digested 
and treated to repair the ends. The resulting plasmid was designated PZID. PZID 
includes DNA encoding the ompT secretion signal operatively linked to DNA encoding 

15 the fusion protein. 

E,coli suains BL21(DE3), BL21(DE3)pLysS, HMS174{DE3) and 
HMS 1 74(DE3)pLysS (Novagen, Madison WI) were transformed with PZID according 
to manufacturer's instructions and the methods described in Example 2.A.2.PZM41 7V2 

20 

c EXAMPLE 3 

Preparation OF Modified Saporin 

f Saporin was modified by addition of a cysteine residue at the N- 

25 terminus-encoding portion of the DNA or by the. addition of a cysteine at position 4 or 
10. The resulting saporin is then reacted with an available cysteine on an FGF to 
produce conjugates that are linked via the added Cys or Met-Cys on saporin. 

Modified SAP has been prepared by altering the DNA encoding the SAP 
by inserting DNA encoding Met-Cys at position -1 or by replacing the He or the Asp 

30 codon within 10 or fewer residues of the N-terminus with Cys. The resulting DNA has 
been inserted into pET 11a and pET 15b and expressed in BL21(DE3) cells. The 
resulting saporin proteins are designated FPSl (saporin with Cys at -1), FPS2 (saporin 
with Cys at position 4) and FPS3 (saporin with Cys at position 10). A plasmid that 
encodes FPSl and that has been used for expression of FPSl has been designated 

35 PZ50B. Plasmids that encode FPS2 and that have been used for expression of FPS2 
have been designated PZ51B (pETl la-based plasmid) and PZ51E (pET15b-based 
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plasmid). Plasmids that encode FPS3 and that have been used for expression of FPS3 
have been designated PZ52B (pETl la-based plasmid) and PZ52E (pET 15b-based 
plasmid). 

5 A* Materials and Methods 

1. Bacterial strains 



Novablue (Novagen, Madison, Wl) and BL21(DE3) (Novagen, Madison 



WI). 



10 2. DNA manipulations 

DNA manipulations were performed as described in Examples 1 and 2. 
Plasmid PZIB (designated PZIBI (the "1" at the end refers to the bacterial host strain, 
BL21 (DE3)) described in Example 2 vvas used as the DNA template. 



15 B. Preparation of saporin with an added cysteine residue at the N-tcrminus 

The DNA encoding SAP-6 was amplified by polymerase chain reaction 
(PCR) from the parental plasmid pZlBl as described by McDonald et al. (1995). The 
plasmid pZlBl contains the DNA sequence for human FGF-2 linked to SAP-6 by a 
two amino acid linker (Ala-Met). pZlBl also includes the T7 promoter, lac operator, 
20 ribosomal binding site, and T7 terminator present in the pET-lla vector. For SAP-6 
DNA amplification, the 5' primer (5* CATATGTGTGTCACATCAATCAC 
ATTAGAT-3*) (SEQ. ID No. 34) cciiresponding to the "sense" strand of SAP-6 
incorporated a Ndel restriction enzyme site used for cloning. It also contained a Cys 
codon at position -1 relative to the start site of the mature protein sequence. No leader 
25 sequence was included. The 3' primer (5' CAGGTTTGGATCCTTTACGTT 3 ) (SEQ. 
ID No 35), corresponding to the "antisense" strand of SAP-6 has a BamHI site used for 
cloning. The amplified DNA was gel purified and digested with AWel and BamHI. The 
digested SAP-6 DNA fiagmeht was subcloned into the Ndel and BamHl digested 
pZlBl . This digestion removed FGF-2 and the 5' portion of SAP-6 (up to nucleotide 

30 position 650) from the parental rFGF2-SAP vector (pZlBl) and replaced this portion 
with a SAP-6 molecule containing a Cys at position -1 relative to the start site of the 
native mature SAP-6 protein. The resultant plasmid was designated as pZ50B. pZ50B 
was transformed into £. coli strain NovaBlue for restriction and sequencing analysis. 
The appropriate clone was then transfonmed into £. coli strain BL21(DE3) for 

35 expression and large scale production. 
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C. Preparation of saporin with a cysteine residue at position 4 or 10 of the 
native protein 

These constructs were designed to introduce a cysteine residue at 
position 4 or 10 of the native protein by replacing the isoleucine residue at position 4 or 
5 the asparagine residue at position 1 0 with cysteine. 

SAP was amplified by polymerase chain reaction (PGR) from the 
parental plasmid pZlB encoding the FGF-SAP fusion protcm using a primer 
corresponding to the sense strand of saporin, spanning nucleotides 466-501 of SEQ ID 
NO. 12, which incorporates a Ndel site and replaces the He codon with a Cys codon at 
1 0 position 4 of the mature protein (SEQ ID NO. 69): 

CATATGGTCACATCATGTACATTAGATCTAGTAAAT. 
or a primer corresponding to the sense strand of saporin, nucleotides 466-515 of SEQ 
ID NO. 1 2, incorporates a Ndel site and replaces the Asp codon with a Cys codon at 
position 1 0 of the mature protein (SEQ ID NO. 70) 

CATATGGTCACATCAATCACATTAGATCTAGTATGTCCGACCGCGGGTCA. 
The 3' primer complements the coding sequence of saporin spanning nucleotides 547- 
567 of SEQ ID NO. 12 and contains a Bamm site (SEQ ID NO. 35): 

CAGGTTTGGATCCTTTACGTT. 
The PCR amplification reactions were performed as described above, 
20 using the following cycles: denaturation step 94*^0 for 1 min, annealing for 2 min at 
60**C, and extension for 2 min at 72^C for 35 cycles. The amplified DNA was gel 
purified, digested with Ndel and BamHl, an^ subcloned into Ndel and BamHl digested 
pZlB. This digestion removed the FGF and 5* portion of SAP (up to the BamHl site) 
from the parental FGF-SAP vector (pZlB) and replaced this portion with a SAP 
25 molecule containing a Cys at position 4 or 10 relative to the start site of the native 
mature SAP protein (see SEQ ID NOs. 29 and 30, respectively). The resulting 
plasmids are designated pZ51B and pZ52B, respectively. 

D. Cloning of DNA encoding SAP mutants in vector pETlSb 

30 1. The SAP-Cys-1 mutants 

The initial step in this construction was the mutagenesis of the internal 
BamHl site at nucleotides 555-560 (SEQ ID NO. 12) in pZlB by PGR using a sense 
primer corresponding to nucleotides 543-570 (SEQ ID NO. 12) but changing the G at 
nucleotide 555 (the third position in the Lys, codon) to an A. The complement of the 

35 sense primer was used as the antisense primer (SEQ ID NO. 73). The first round of 
amplification used primers SEQ ID NOs. 34 and 73 or 37 and 74 conducted as in B 
above. Individual fragments were gel piuified and a second round of amplification was 
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performed using primers of SEQ ID Nos. 34 and 74 as in B, above. This amplification 
introduced a Ndel site and a Cys codon onto the 5* end of the saporin-cncoding DNA. 
The antisense primer was complementary to the 3* end of the saporin protein and 
encoded a BamUl site for cloning and a stop codon (SEQ ID NO. 37): 
5 GGATCCGCCTCGTTTGACTACTT. 

The resulting fragment was digested with NdeVBamHl and inserted into 
pETlSb (Novagen, Madison, WI), which has a His-Tag'"^ leader sequence (SEQ ID 
NO. 36), that had also been digested with NdeVBamHl. 

1 0 2. The S AP-Cys+4 and Sap-Cys+10 mutants 

This construction was performed similarly to the SAP-Cys-1 using 
pZlB as the starting material, and splice overlap extension (SOE) using PZIB as the 
starting plasmid, including mutagenesis of the internal BamUl site at nucleotides 555- 
560 (SEQ ID NO. 12) in pZlB by PGR using a sense primer corresponding to 
1 5 nucleotides 543-570 (SEQ ID NO. 12) but changing the G at nucleotide 555 (the third 
position in the Lys codon) to an A and introduction of the cys at position 4 or 10 in 

place of the native amino acid. 

The first round of amplification used primers of SEQ ID NOs. 69 and 73 
(for the cys+4 saporin mutants) or SEQ ID NOs. 70 and 73 for the cys+10 saporin 
20 mutants): 

CATATGGTCACATCATGTACATTAGATCTAGTAAAT (SEQ ID NO. 69) ; 
CATATGGTCACATCiUlTCACATTAGATCTAGTATGTCCG^^ (SEQ IE> NO. 70) ; 

TTTCAGGTTTGGATCTTTTACGTTGTTT ( SEQ ID NO - 7 3 ) . 

Amplification conditions were as follows: denaturation for 1 min at 
25 94'"C, annealing for 2 min at TO^C and extension for 2 min at 72*C for 35 cycles. 
Individual fragments were gel purified and subjected to a second round of 
amplification, following the same protocol, using only the external oligos of SEQ ID 
NO. 37 and SEQ ID NO. 69 for the cys+4 mutant or SEQ ID NO. 70 for the cys+10 
mutant. The resulting fragments had a Ndel site on the 5' end of the saporin-encoding 
30 DNA and a BamHl site for cloning and a stop codon on the 3* end. The resulting 
fragment was digested with Ndel/BamUl and inserted into pET 15b (Novagen. 
Madison, WI). which has a His-TagTM leader sequence (SEQ ID NO. 36), that had 

also been digested NdeVBamHl. 

DNA encoding unmodified SAP (EXAMPLE 1) can be similarly 
35 inserted into a pETl 5b or pETl la and expressed as described below for the modified 
SAP-encoding DNA. 
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Eh. Expression of the modified saporin-encoding DNA' 

The £. coli cells containing Cys-1 SAP construct were grown in a high 
cell density fed-batch fermentation with the temperatiire and pH controlled at 3(rC and 
6.9, respectively. A glycerol stock (1 mL) was grown in 50 ml of Luria Broth until 
5 A«K) reached 0,6, Inoculum (10 mL) was injected into a 7 L Applikon (Foster City, 
CA) fermentor containing 2 L of complex batch media consisting of 5 g/L of glucose, 
1.25 g/L, each, of yeast extract and tryptone (Difco Laboratories, Detroit, MI, U.S.A.), 
7 g/L of K2HPO4, 8 g/L of KH2PO4, 1.66 g/L of (NH4)2S04, 1 g/L of MgS04*7H20, 
2 mL/L of a trace metal solution (74 g/L of NaaCitrate, 27 g/L of FeCl3»6H20, 2.0 g/L 

10 of CoCl2*6H20, 2.0 g/L of Na2Mo04*2H20, 1.9 g/L of CuS04*5H20, 1.6 g/L of 
MnCl2^4H20, 1.4 g/L of ZnCl2^4H20, 1 .0 g/L of CaCl2*2H20, 0.5 g/L of H3BO3), 
2 mL/L of a vitamin solution (6 g/L thiamine^HCK 3.05 g/L of niacin, 2.7 g/L of 
pantothenic acid, 0.7 g/L of pyridoxine»HCl, 0.21 g/L of riboflavin, 0.03 g/L of biotin, 
0.02 g/L of folic acid), and 1 00 mg/L of carbenicillin. The culture was grown for 

15 12 hour before initiating the continuous addition of a 4Dx solution of complex batch 
media lacking the phosphates and containing only 25 mL/L, each, of trace metal and 
vitamin solutions. The feed addition continued until the A^oo of the culture reached 85, 
at which time (approximately 9 h) the culture was induced with 0.1 mM IPTG. During 
4 h of post-induction incubation, the culture was fed with a solution Containing 100 g/L 

20 of glucose, 100 g/L of yeast extract, and 200 g/L of tryptone. Finally, the cells were 
harvested by centrifiigation (8,000 x g, IQmin) and frozen at -80^C until further 
processed. 

F. Purification and conjugation of modified saporin 

25 The cell pellet (-400g wet weight) containing Cys-1 SAP was 

resuspended in 3 voliunes of buffer B (10 mM sodium phosphate, pH 7.0, 5 mM 
EGTA* and 1 mM DTT). The suspension was passed through a niicrofluidizer three 
times at 18,000 Ib/in^ on ice. The resultant lysate was diluted with NanoPure H2O until 
conductivity fell below 2.7 mS/cm. All subsequent procedures were performed at room 

30 temperature. 

The dilute lysate was loaded onto an expanded bed of Streamline SP 
cation-exchange resin (300 ml) pre-equilibrated with buffer C (20 mM sodium 
phosphate, pH 7.0, 1 mM EDTA) at 1 00 mL/min upwards flow. The resin was washed 
with buffer C until it appeared clear. The plunger was then lowered at 2 cm/min while 
35 washing continued at 70 mL/min. Upwards flow was stopped when the plunger was 
approximately 8 cm away from the bed, and the plunger was allowed to move to within 
0.5 cm of the packed bed. The resin was further washed at 70 mL/min downwards flow^ 



wo 96/06641 



82 



until A280 reached baseline. Buffer C plus 0.25 M NaCl was then used to elute proteins 
containing Cys-1 SAP at the same flow rate. 

The eluate was buffer exchanged into buffer D (50 mM sodium borate, 
pH 8.5, 1 mM EDTA) using the Sartocon Mini crossflow filtration system with a 
5 10,000 NMWCO module (Sartorious, Goettingen, Germany). The sample was then 
applied to a column of Source 158 (30 mL) pre-equiUbrated with buffer D. A 10 
column volume linear gradient of 0 to 0.3 M NaCl in buffer D was used to elute Cys-1 

SAP at 30 mL/min. 

Both Cys-1 SAP and C96S FGF-2 were reduced with a final 

10 concentration of 10 mM DTT prior to gel filtration with buffer E (0.1 M sodium 
phosphate, pH 7.5, 0.1 M NaCl, 1 mM EDTA). The Cys-1 SAP was then reacted with 
80-fold molar excess of DTNB at room temperature for 1 h, and the amount of Cys-1 
SAP-TNB was determined by measuring absorbance at 412 nm using the molar 
absorption coefficient of 14,150 M-'cm'. The Cys-1 SAP/DTNB mixture was 

1 5 subjected to size exclusion chromatography and eluted with buffer E. The C96S FGF-2 
was added to the DTNB-treated Cys-1 SAP in a molar ratio of 3:1, and the reaction was 

carried out at 4''C ovemi^t. 

The reaction mixture was loaded onto a column of Heparin-Sepharose 
CL-4B pre-equilibrated with 0.5 M NaCl in buffer F (10 mM sodium phosphate, pH 
20 6.0, 1 mM EDTA). The column was washed with 0.5 M then 1 MNaCl in buffer F, 
and the conjugate eluted with 2 M NaCl in buffer F. Fractions containing FGF2-Cys-1 
SAP were combined, concentrated, and applied to a column of Superdex 75. Buffer G 
(10 mM sodium phosphate, pH 6.0. 0^15 M NaCl, 0.1 mM EDTA) was used for the 
Superdex 75 column. 

25 During Cys-1 SAP purification, SDS-PAGE was performed on 12% 

acrylamide Mini-PRQTEAN II R<»dy Gels (Bio-Rad, Hercules, CA, U.S.A.) according 
to the method of Laemmli (1970) under non-reducing conditions. PhastSystem using 
10-15% acrylamide gradient gels. 



30 



EXAMPLE 4 

Production of VEGF, VEGF-SAP and SAP-VEGF Constructs 



A. General Descriptions 
35 1. Bacterial Strains and Plasmids: 

£. coli strains BL21(bE3). BL21(DE3)pLysS, HMS174(DE3) and 
HMS 1 74(DE3)pLy sS were purchased from Novagen. Madison. Wl ; 
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2, DNA Manipulations 

Native SAP and rabbit polyclonal antiserum to SAP were obtained as 
described above or as described in Lappi et al. (1985) Biochem. Biophys, Res, Comm,. 
5 72P:934-942 and Lappi et al. (1989) Biochem, Biophys.; Res. Comm,, 760:917-923. 
The pET System Induction Control was purchased from Novagen, Madison, WI. The 
sequencing of the different constructions was done using the Sequenase kit of United 
States Biochemical Corporation (version 2.0). Minipreparation and maxipreparations 
of plasmids, preparation of competent cells, transformation. Ml 3 manipulation, 
10 bacterial media and Western blotting were performed using routine methods (see, e.g., 
Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratorj' Press, Cold Spring Harbor, NY). The purification of DNA fragments was 
done using the Geneclean II kit, purchased from Bio 101 . SDS gel electrophoresis was 
performed on a Phastsystem (Pharmacia). 

15 

3. Materials: 

Bacterial strains: Novablue and BL21(DE3) (Novagen, Madison, WI). 
Constructs that have been prepared include: 

VEGFi65^SAP containing the VEGF leader sequence (amino 
20 acids 1-26; see, e,g, SEQ ID NO. 26); 

VEGF 1 65 * SAP without the leader sequence; 
SAP:VEGFi65 contaiilang the VEGF leader sequence; 
SAP: VEGF 1 65 without the leader sequence; 
VEGF 165 containing the leader sequence; and 
25 VEGF 1 65 without the leader sequence; and similar constructs 

with VEGF121 (see also Table 3). 

Constmcts containing my of VEGF121. VEGF 189 and VEGF2O6 »n 
place of VEGF 1 65 ^ prepared in a similar manner exc^t that DNA encoding 
VEGF 121 (SEQ ID NO. 25), VEGF j 39 (SEQ ID NO. 27) and VEGF2O6 (SEQ ID NO. 
30 28) is used in place of the VEGFi65-encoding DNA in the above constructs, and, 
where necessary, appropriate amplification primers are selected. 

VEGF-encoding DNA was obtained from plasmids designated pUC- 
121, pUC-165 and pUC-189 (the plasmids were the gift of Judith Abraham). Each of 
these plasmids had been prepared by inserting the respective DNA clone containing 
35 each form of VEGF linked to the signal peptide (see. SEQ ID NO. 26, nucleotides 13- 
90) into the BamHl site of the well known and commercially available vector pUC18 
(for descriptions of this vector, see, e.g., U.S. Patent Nos. 5,114,840. 4,992.05 K 
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4,968,613, 4,898,828; see, also Yanisch-Perron et al: (1985) Gene 5i:103-119; 
Norrander et al. (1983) Gene 26:101-106; available from, for example. Life 
Technologies, Inc, Rockville, MD). 



5 B. Construction of plasmids encoding VEGF-SAP fusion proteins 

1. Construction of plasmids that contains DNA encoding VEGFi65 

a. Cloning 

(i) VEGF165-SAP constructs 

The VEGF165-SAP constructs were prepared using the parental 
1 0 FGF:SAP vector pZlB (pET 1 la-based vector; see Example 2) that had been digested 
with Ndel and Ncol in order to remove the FGF-encoding portion of the DNA encoding 
the fusion protein, but leave the SAP-encoding portion intact. The FGF-encoding 
region was then replaced v«th the VEGFl65-encoding DNA that has a Ndel site at the 
5' end and a Ncol site at the 3' end. 

15 

(ii) VEGF165 constructs 

To express the VEGF165 protein without saporin, the pETlla vector 
was digested with Ndel and BamHl and the VEGF sequence inserted with the 
appropriate ends. For the constructs containing VEGF alone, the 3' primer contained a 

20 BamHl site for cloning and encodes a stop codon as follows: 

5- GGATCCTCATCACCGCCTCGGCTT 3' (SEQ ID NO. 64). 



b. Amplilication 

(i) VEGF165-5AP constructs with the leader sequence 

25 The constructs with the VEGF leader sequence were prepared from 

VEGF165 as the template. For plasmids containing VEGF with the leader sequence or 

VEGF-SAP containing the leader sequence of the VEGF, the 5' primer contained a 

Ndel restriction enzyme site and encodes the signal sequence as follows: 

5' CATATGAACTTTCTGCTGTCTTGG 3' (SEQ ID NO. 62), which 

30 contains a Ndel restriction enzyme site and encodes the signal. The Ncol site within 
VEGF 165 was removed by SOE as the first step in the preparation of this construct 
using oligonucleotides of SEQ ID NOs. 60 and 61 . The first round of PGR used ohgos 
of SEQ ID NOs. 62 and 61 and SEQ ID NOs. 60 and 65. 

5- TCCCAGGCTGCACCAATGGCAGAAGGAGGA 3" (SEQ ID NO. 60: sense 
primer)- 5' TCCTCCTTCTGCCATTGGTGCAGCCTGGGA 3" (SEQ ID NO. 61 : the 
complement or "antisense" primer); and 5' CCATGGCCGCCTCGGCTTGTC 3'(SEQ 
ID NO. 65; 3' primer that removes the stop codons and introduces a Ncol site). 



35 
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Amplification was performed as follows: dendturation for 1 min at 94*'C 
annealing for 2 min at 70"C and extension for 2 min at 72»C for 35 cycles. Individual 
fragments were gel purified and subjected to a second round of amplification using only 
the external oligos (SEQ ID NOs. 62 and 65) under the same amplification conditions 
as above, to generate full length fragments which contain the appropriate cloning sites 
at the ends. After amplification and purification, the inserts are directionally cloned 
into the sites MfelWcoI -digested PZIB. 

(ii) VEGFj 65-SAP constructs without the leader 

sequence 

To prepare the constructs that lack DNA encoding the leader sequence, a 
similar amplification was performed with a primer similar to SEQ ID NO, 62, but 
lacking the signal sequence and beginning at position 1 of the mature protein as 
follows: 

1 5 5' CATATGGCACCAATGGCAGAAGGAGGAGG 3'(SE0 ID NO. 63). 

2, Construction of plasmids that contain DNA encoding VEGF121 

The same type of constructs are generated using DNA encoding the 
VEGFi 21 -encoding DNA, except that DNA encoding VEGF121 (see, SEQ. ID NO, 
25) is used in place of the DNA encoding VEGF165. The initial step requires the 
mutagenesis of an internal Ncol site located at position 95 in the mature VEGF protein 
by nucleic acid amplification using the VEGFi 21 -encoding DNA as a source of DNA. 

The "sense" oligo the primer has the sequence: 
5' TCCCAGGCTGCACCAATGGCAGAAGGAGGA 3' (SE^ ID NO. 60); 
25 and the complement or "antisense" primer has the sequence: 

5' TCCTCCtTCTGCCATFGGTGCAGCCTGGGA 3* (SEQ ID NO. 61) as described 
forVEGFi65. 

For the VEGF] 21 or VEGF] 2 1 -SAP constructs that contain the leader 
sequence the 5' primer has the following sequence: 

30 5 •CATATGAACTTTCTGCTGTCTTGG 3' (SEQ ID NO. 62). 

This primer contains a Ndel restriction enzyme site and encodes the signal sequence. A 
second amplification is performed with similar primer that lacks the signal sequence 
and begins at position 1 of the mature protein. This primer has the following sequence: 
5' CATATGGCACCAATGGCAGAAGGAGGAGG 3' (SEQ ID NO. 63). 

35 For the constructs containing VEGF alone, the 3' prinier contains a 

BamHl site for cloning and encodes a stop codon. This primer has the following 
sequence: 



20 
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5' GGATCCTCATCACCGCCTCGGCTT 3'(SEQ ID NO. 64). 
For the constructs designed to express the VEGF-SAP fusion protein, the stop codons 
have been rentoved and a Ncol site is introduced onto the 3' primer that has the 
sequence: 

5 5' CCATGGGCGCCTCGGCTTGTC 3" (SEQ ID NO. 65). 

The amplification conditions followed the above protocol. 

C. Construction of plasmids encoding SAP-VEGF fusion proteins 

The following constructs have been prepared: 

10 1) SAP-VEGF121 

2) SAP-VEGF 165 

3) SAP-linkcr-VEGFi21 

4) SAP-linker-VEGFi65 

5) SAP-Iinker-VEGFi21-linker-VEGFi21 
15 6) SAP-linker-VEGFi65-nnker-VEGFi65 

in which the linker is (Gly4Scr)n, where n is selected from 1 , 2 or 4. DNA encoding 
any other suitable peptide linker, see, e.g., SEQ ID NOs. 38-50, can be substituted for 
the exemplified linkers. For other constructs, see Table 3. 

Constructs 1 -4 serve as cloning intermediates for the final forms 5 and 6. 
20 All forms have been completely characterized. 

All cloning was performed using the vector pET-SAPMCS. The starting 
material for this vector can be PZIA or any of the pET lla based vectors herein. 
Unmodified saporin can be cloned, using PCR amplification with appropriate primers, 
into the and Npol sites of the pETl 1 a-based vector. Using ^propriate primers, an 
25 £coRI site is added 5' to and adjacent to the Ndel site, and an fcoRI site is added 3' to 
and adjacent to the Ncol site. The resulting amplified fragment is digested with £coRI 
and subcloned into the £coRI site of plasmid pGEM-4 {pGEM-4 serves as the source of 
the MCS, the pGEM series of plasmids are available from Promega, Madison Wl; see 
also, U.S. Patent No. 4.766,072, which describes construction of the pGEM plasmids) 
30 in an orientation, such that the multicloning site (MCS) of pGEM-4 is 3' of the saporin- 
encoding sequences. In such constructs, the resulting plasmid (pGEMSAP) was 
digested with Pstl and the ends of the fi^gment were blunt-ended. The firagment was 
then digested with Ndel. thereby generating a fi^gment that contains all of the saponn- 
encoding DNA and most of the MCS of pGEM-4. This Augment was then cloned into 
35 the NdeUBamVa sites of pET 1 la, in which the BamUl site had been blunt-ended by 
filling in with Klenow polymerase and then cut with Ndel to produce NdellBamWl 
blunt ends. The resulting plasmid was designated PETSAP-MCS. It has unique Sac\. 
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Smaly and Sail sites in the MCS for insertion of DNA encoding a desired linker, VEGF 
monomer, or combination of VEGF and linker 3* of the saporin-encoding DNA. 

1. SAP-VEGF121 and SAP-VEGF165 

5 The VEGF-encoding DNA is cloned downstream from SAP using the 

Ncol site at the C-terminus of SAP and one of several enzyme sites contained in the 
flanking region. For these constmcts the VEGF molecule was amplified from cDNA 
using oligos that introduce a Ncol (CCATGG) site onto the N-terminus of the mature 
protein (and also remove an internal Ncol), and introduce a stop codon at the C- 
1 0 terminus of VEGF as well as a Sail (GTCGAC) site for cloning. In each case, the 
appropriate parental vector was digested with Ncol and Sail, and a Ncol/Sia/I digested 
insert was cloned into that site. PGR conditions were as for the amplification reactions 
described above. 

Amplification was done using the following 5* sense oligo: 
1 5 5* CC ATGGCACC AATGGC AGAAGGAGGA 3' (SEQ ID NO. 5 1 ), 

and 3' anti-sense oligo: 

5* GTCGACTCATCACCGCCTCGGCTT 3' (SEQ ID NO. 52). 

2. SAP-linker-VEGFi21 and SAP-Iinkcr-VEGFi65 

20 For the generation of the linker-VEGF constructs, a different 5' primer 

that adds a Ncol site to the N-terminus, mutates the internal Ncol site and adds either 
the DNA encoding (Gly4Ser) (SEQ ID fiO. 40), designated XI, or (Gly4Ser)2 
designated X2 (SEQ ID NO. 41) onto the N-teraiinus of the VEGF molecule was used. 
The 3' primer was the above oligonucleotide (SEQ ID NO. 52). For the constructs with 

25 the (Gly4Ser)-encoding DNA, the 5' primer oligo was: 

5' CCATGGGCGGCGGCGGCTCTGCACC AATGGCAGAAGGA 3' <SEQ ID NO. 

53). 

For the (Gly2Ser)2 linker, this oligo was: 

5'CCATGGGCGGCGGCGGCTCTGGCGGCGGCGGCTCTGCACCAATGGCAGAA 
30 GGA3* 

(SEQ ID NO. 54). The sequence of S AP-(Gly4Ser)- VEGF121 is set forth in SEQ ID 
NO. 57. The sequence of SAP-(Gly4Ser)-VEGFi 65 is set forth in SEQ ID NO. 58. 

The constmct in which the linker is (Gly4Ser)4 was prepared by 
digesting a plasmid (designated PZ74B or PZ74F) which contains SAP-AlaMet- 
35 VEGF165 construct with Ncol and inserting a fragment encoding Ncol- (Gly4Ser)4- 
Ncol (prepared, for example, by inserting codons encoding (GIy4Ser)2 between the 
Gly4Ser and Gly4Ser in SEQ ID NO. 41). 
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3- SAP-linker- VEGFi65-IiDkcr-VEGFi65 and SAP.linkcr-VEGFi21- 
linkcr-VEGFi21 constructs 

For consdruction of the SAP-linker-VEGF-linker-VEGF constructs, the 
5 same 5*oIigos were used for the constructs incorporating (Gly4Ser)i and (Gly4Ser)2 
(see, SEQ ID NOs. 53 and 54, respectively) and a set of 3* oligos were prepared that 
incorporated (Gly4Ser)i or (Gly4Ser)2 and a Ncol site. The SAP-AlaMet-VEGF 
parental construct was digested with Ncol and the NcoI-linker-VEGF-NcoI fragment 
was inserted to produce constructs containing SAP-linker- VEGF-linker-VEGF. 
1 0 The C-terminu5 oligos for the (Gly4Ser)i linker was: 

5'CCATGGCAGAGCCGCCGCCGCCCCGCCTCGGCTTGTCACAT 3* (SEQ ID 
NO. 55). The (Gly4Ser)2 linker for the 3* portion was (SEQ ID NO. 56): 
5*CCATGGCAGAGCCGCCGCCGCCAGAGCCGCCGCCGCCCCGCCTCGGCTTG 
TCACAT 3', Amplification conditions were as described above. 

15 The sequence of SAP-(Gly4Ser)-VEGFi21-(Gly4Ser)-VEGFi21 is set 

forth is SEQ ID NO. 78 and the sequence of SAP-(Gly4Ser)-VEGFi65-(Gly4Ser)- 
VEGF165 is set forth in SEQ ID NO. 79. 

4. SAP-Linker*VEGFi21 constructs 

20 These constructs were prepared in a similar maimer to the VEGF165- 

containing constructs, except that plasmids containing VEGF12I were used as the 
starting materials. 

* 

D. Expression of the SAP-VEGF and SAP-LINKER- VEGF constructs 
25 The plasmids containing the various SAP-VEGF and SAP-LINKER- 

VEGF constructs (see Table 3) have been introduced into various vectors and host and 
are cultured under conditions suitable for expression in the selected host/vector. The 
resulting fusion protein is then purified as described for VEGF using heparin sulfate 
(see. eg., U.S. Patent No. 5.219,739 to Fischer et al.; U.S. Patent No. 5.194.596 to 
30 Tisher et al.; U.S. Patent No. 5.240.848 to Keck et al.; International PCT Application 
No. WO 90/13649, which is based on U.S. applications serial nos. 07/351.361. 
07/369,424, 07/389,722. to Genentech. Inc.. ' and any U.S. Patent based U.S. 
applications Serial Nos. 07/351.361, 07/369.424, 07/389.722; European Patent 
Applications EP 0 506 477 Al and EP 0 476 983 Al to MERCK & CO.; Houck et al. 
35 (1991) Mol. Endo. 5:1806-1814). An affinity column v^ith anti-SAP antibody may 
alternatively be used to purif^' VEGF conjugates, especially for SAP-VEGF] 21 • 
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E. Cytotoxicity of VEGF fusion protein conjugates 

Cytotoxicity experiments are perfonned with the Promega (Madison. 
WI) CellTiter 96 Cell Proliferation/Cytotoxicity Assay. About 1,500 bovine or human 
aortic endothelial cells or other vascular endothelial cells are plated per well in a 96 
5 well plate in 90 ^1 HDMEM plus 10% FCS and incubated overnight at 37**C, 5% C02- 
The following morning 1 0 nl of media alone or 1 0 jil of media containing various 
concentrations of the fusion protein, VEGF dimer or saporin are added to the wells. 
The plate was incubated for 72^ C hours at 37 C. Following the incubation period, the 
number of living cells are determined by measiuing the incorporation and conversion of 

10 the commonly available dye MTT supplied as a part of the Promega kit. Fifteen ^l of 
the MTT solution was added to each well, and incubation was continued for 4 hours. 
Next, 100 ^l of the standard solubilization solution supplied as a part of the Promega 
kit are added to each well. The plate is allowed to stand overnight at room temperature 
and the absorbance at 560 nm was read on an ELISA plate reader (Titertek Multiskan 

1 5 PLUS, ICN, Flow, Costa Mesa, CA). 

EXAMPLE 5 
Baculovirus Expression of VEGF 

20 

A. Materiab: 

The VEGF constructs, including the VEGF121, VEGFi2i:SAP, 
VEGF165 and VEGFi65:SAP constructs containing the leader sequences are 
introduced into a bacuolovirus vector pBluebac III (Invitrogen, San Diego, CA) and 
25 then co-transfected with wild type virus into insect cells Spodopiera frugiperda (sf9; 
see, e.g., Luckow et al. (1988) Bio/technology 6:47-55 and U.S. Patent No. 4,745,051) 
cells). 

Antisera to VEGF was obtained from R&D/Peprotech (polyclonal anti- 
native VEGF) and Santa Cruz (polyclonal anti-VEGF peptide antibody). 
30 Constructs that are prepared include: VEGF] 2 1 containing the leader 

sequence and VEGF 1 65 containing the leader sequence. The fusion proteins in which 
saporin is linked to the N-terminus of a VEGF monomer are presently preferred for 
baculovirus expression (and also bacterial expression). Heterologous leader sequences, 
discussed below, that direct secretion of the encoded fusion protein are added. 

35 
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B. Amplification 

The template for these constructs is the VEGF121 or VEGFi65 or the 
VEGFi65:SAP construct containing the leader sequence in pETIla, described above. 
The 5* oligo (sense) for the VEGF12I, VEGFi2i:SAP and VEGFi65 constructs 
5 contains a BamHI I site for cloning into the vector and is as follows: 

5* GGATCCGAAACATGAACTTTCTGCTGTCT 3' (SEQ ID NO. 66). 

The VEGFi65:SAP construct is amplified from the existing 
VEGFi65:SAP insert in pETl la using the following 5' oligo, which contains a BamVR 

1 site for cloning and is: 
1 0 5' GGATCCGAAAC ATATG AACTTTCTGCTGTCT 3'(SEQ ID NO. 67). 

The 3* or non-coding oligo for the VEGF12I :SAP or VEGFi 65:SAP constructs 
contains a Pstl I site for cloning into the vector and has the sequence: 
5' CTGCAGGCCTCGTTTGACTACTT 3* (SEQ ID NO. 71). 

The oligo for the 3* end of the YEGF121 and VEGF|65 has the 
15 sequence (SEQ ID NO. 68): 3' CTGCAGTCATCACCGCCTCGGCTT 3'. 
Amplification follows the same protocol as described in the above Examples: 
denaturation for 1 min at 94'»C, annealing for 2 min at 70*C, and extension for 2 min at 
72'*C for 35 cycles. 

20 C. Cloniag 

The inserts are directionally cloned into the BamYWPstl sites of 

pBlueBac.III. 

D. Preparation of VEGF molecules with an accessible cysteine residue at the 
25 N-terminus for chemical conjugation 

VEGF molecules vwth an accessible cysteine residue at the N-terminus 
are constructed. These molecules can be chemically conjugated to one of the SAP 
muteins (Cys -1, +4 or +10 as described above). These constructs are as follows: 

1 ) VEGF 12 1 vvith a cys at +4 
30 2) VEGF 165 with a cys ^t +4 

3) VEGF12I with a cys at +2 followed by a Ncol site which makes 

this construct linker amenable. 

4) VEGF 165 with a cys at +2 followed by a Ncol site for the linker 

amenable form. 

35 These constructs are designed such that the distance between the 

molecules (or accessible cysteines) can be increased by adding various linkers encoded 
on a Nco\ (CCATGG) fragment, and thereby decrease any steric hindrance. The 
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presently preferred linkers are the linkers set forth in (Gly4Ser)n, in which n=l-4, 
linkers (see, SEQ ID NOs. 40 and 41, for n= 1 and 2). 

Amplification of the template for these constructs is VEGF12I or 
VEGFi65-encoding DNA (see, SEQ ID NOs. 25 and 26, respectively). The 5* sense 
5 oligo for the introduction/lnsertion of the mutations into the mature forms of the 
proteins VEGFI21/165 cys+4 constructs (1 and (2 above) is: 
5' TGGTCCCAGGCTGCACCCATGTGTGAAGGAGGAGGGCAGAATCAT 3' 
(SEQ ID NO. 80). 

The corresponding anti-sense mutational oligo is: 
1 0 5' ATG ATTCTGCCCTCCTCCTTCACAC ATGGGTGCAGCCTGGGACCA 3'(SEQ 
ID NO. 81). 

The 5* sense oligo for the introduction/insertion of the cys mutations into 
the mature forms of the proteins VEGF121/165 cys+2 Ncol constructs (3 and (4, above 
is: 

1 5 5' GCC AAGTGGTCCCAGGCTGCATGTCCCATGGCAGAAGGAGGAGGGCAG 3' 
(SEQ ID NO. 82). 

The corresponding anti-sense mutational oligo is: 

5* CTGCCCTCCTCCTTCTGCCATGGGACATGCAGCCTGGGACCACTTGGC 3' 
(SEQ ID NO. 83). 

20 The 5' sense oligo containing the BamHl (GGATCC) cloning site for introduction into 
the baculovinis transfer vector pBlueBacIII for each of the above forms (1- (4, above, 

is: ^ ^ 

y GGATCCGAAACATGAACTTTCTGCTGTCT 3* (SEQ ID NO. 66). 
The 3' anti-sense oligo containing the Psil (CTGCAG) site for cloning into the 

25 pBlueBacIII transfer vector for each of the above constmcts is: 

5' CTGCAGTCATCACCGCCTCGGCTT 3' (SEQ ID NO. 68). 

The constructs are prepared by splicing by overlap extension (SOE) by 
amplification of two pieces of the protein, which are then put together by the SOE 
technique. For example, to generate the VEGF cys+4 forms the first round of 

30 amplification uses oligos of SEQ ID NOs. 81 and 66 and SEQ ID NOs. 80 and 68. For 
the VEGF cys+2 Ncol constructs the first round of PGR would use oligos of SEQ ID 
NOs. 66 and 83 and 82 and 68, After amplification as follows: denaturation for 1 min 
at 94^C. annealing for 2 min at lO^'C and extension for 2 min at 72'*C. individual 
fragments are gel purified and subjected to a second round of amplification using only 

35 the external oligos (SEQ ID NOs. 66 and 68), under the same amplification conditions 
as above, to generate full length fragments which contain the appropriate cloning sites 
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at the ends. After amplification and purification, the inserts are directionally cloned 
into the BamHl and Pstl sites of the pBlueBacIIl transfer vector. 

The constructs and corresp>onding Sequence Listing ID Nos. are as 



follows: 



and 



1) VEGF121 cys +4 is set forth in SEQ ID NO. 86; 

2) VEGF165 cys +4 is set forth in SEQ ID NO. 87; 

3) VEGF121 Cys+2 with Ncol sites is set forth in SEQ ID NO. 88; 

4) VEGFi 65 Cys+2 with Ncol is set forth in SEQ ID NO. 89. 



E- Preparation of SAP: VEGF constructs with heterologous signal (leader) 
sequences 

Constructs containing a heterologous signal sequence in place of the 
VEGF signal sequence {see, e.g., amino acids 1-26 in SEQ ID NO. 33) or in addition to 

15 it are prepared. Such constructs are prepared using vectors such as pPBac and pMBac 
(available from Stratagene, San Diego, CA, see, also Lemhardt et al. (1993) Strategies 
6:20-21), which contain the human alkaline phosphastase (^ee. e.g.. Bailey et al. (1989) 
Proc, Natl Acad Sci, U, A, 86:22-26) and melittin (see. e.g., Tessier et al. (1991) 
Gene 95:177-183) secretory signals inserted into the BamHl and Ndel sites, 

20 respectively of pJVPlOZ (5ee. e.g., Kawamoto et al. (1991) Biochem, Biophys, Res. 
Commun, 75/:756-63, Ueda et al.(1994) Gene 7^0:267-272. Insertion of genes into the 
Smal/BamUl sites of these vectors results in fusion proteins that are directed into the 
insect cell secretory pathway, v^iiich processes the pro-polypeptide so that mature 
peptide or fusion protein is secreted into the growth medium. 

25 Other heterologous signal sequences, such as the insulin signal sequence 

(see. e.g., U.S. Patent No. 4,431,746 for DNA encoding the signal sequence), the 
growth hormone signal sequence, mammalian alkaline phosphatase, the melHtin signal 
sequence and others that are processed by insect cells are used. 

The heterologous signal sequences are used in other constmcts as well, 

30 including VEGF:SAP constructs, in order to direct the proteins encoded by operatively 
linked DNA into the periplasmic space or growth medium. 

F- Expression of the VEGF fusion protein-encoding constructs 

The plasmids containing the various SAP-VEGF and SAP-linker-VEGF 
35 constructs (see Table 3) have been introduced into the baculovirus host and are cultured 
under conditions suitable for expression in the selected host/vector. The resulting 
fusion proteins are then purified. 
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Characterization of the VEGF-SAP fusion protein 

1. Western blot of affinity-purified VEGF-SAP fusion protein 

SDS gel electrophoresis was perfonned on a Phastsystem utilizing 
5 10-15% gels (Pharmacia). Western blotting was accomplished by transfer of the 
electrophoresed protein to nitrocellulose using the PhastTransfer system (Phamiacia), 
as described by the manufacturer. The antisera to SAP and VEGF are used at a dilution 
of 1:1000 dilution. Horseradish peroxidase labeled anti-IgO was used as the second 
antibody (Davis et al. (1986) Basic Methods in Molecular Biology, New York, Elsevier 
1 0 Science Publishing Co., pp 1 -338). 

2. Assays to assess the cytotoxicity of the VEGF-SAP fusion protein 

a. Effect of VEGF-SAP fusion protein on cell-free protein 
synthesis 

15 The RIP activity of VEGF fusion protein is assayed as described in 

procedures for EOF conjugates (see, e.g., U.S. Patent No. 5,191,067). 

b. Cytotoxicity off VEGF-SAP fusion protein 

Cytotoxicity of the VEGF fusion protein is assayed as described in U.S. 

20 Patent No. 5,191 ,067), except that vascular endothelial cells, such as a human or bovine 
aortic endothelial cells, are used. Prior to contacting with the VEGF conjugate the 
VEGF receptors can be up-regulated. Brieflyt the cells are seeded at density of 1 - 5 x 
10^ cells/per well (in 24 well plates) and are incubated with varying concentrations of 
the test protein at 37®C for 5-7 days. Prior to contacting with the test protein the 

25 VEGF receptors can be upregulated, such as by replating or pretreating with VEGF. 
The cells are then trypsinized and coimted in a Coulter counter. 



EXAMPLE 6 

30 Expression of VEGF and VEGF-SAP in the pP^-X system 

The VEGF121, VEGFi65, VEGFi2i:SAP, VEGFi65:SAP, 
SAP: VEGF 121, SAP:VEGFi65 constructs are also expressed in the pPf^ system 
(Pharmacia Fine Chemicals, see, also^ U.S. Patent No. 5,227,469). This system is 
35 temperature inducible and directs the expressed protein to inclusion bodies thus 
protecting the protein from degradation. The £coRI and Xbal sites are used for 
isolation of the VEGF or VEGF-SAP-encoding DNA from existing constructs. 
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Constructs are cloned into the unique Hpal site of pPj^-lambda {see, e.g., 
Remaut et al. (1981) Gene 75:81, available from Pharmacia). The Pl promoter, which 
is controlled by the cl repressor of K can be thermo-regulated using a bacterial host 
strain (N4830-1) containing the temperature-sensitive cI857 repressor. Induction is 
5 effected by raising the temperature. 

All cloning is done in a host such as N99CI^ and then transferred into 
N4830-1 for induction. Thus, the plasmid containing the consUruct is introduced into the 
host and grown at 30**C until A600 = 0.8- LO. The temperature is raised to 42X for 2 
hours and the expressed protein is targeted to inclusion bodies. 
1 0 For example, a plasmid, such as PZ70B 1 , is digested with XbaVEcoRl to 

release a fragment that contains the ^al-ribosome binding site-VEGFi65-SAP-T7 
terminator-fcoRI site, and the ends are filled in with Klcnow reagent. Plasmid pPf^ is 
digested with Hpal and the blunt-ended fragment is ligated into the digested plasmid. 
The plasmid is introduced into the N4830-1, grown at 30**C and induced at 42^C. The 
15 fusion protein is recovered from the inclusion bodies. 

The inclusion bodies are released from the cells by concentrating the 
cells, such as by centrifiigation, and are resuspended in a buffer (-0.4-0.6 M salt). The 
cells are lysed, either mechanically by homogenization or en2ymatically, such as by 
treatment with lysozyme or EDTA. Soluble materials are removed by sequential 
20 centrifugation and resuspension or diafiltration. Further purification can be effected by 
centrifiigation in a sucrose gradient. 

The purified inclusion body fraction is then solubilized and the residual 
insoluble material is pelleted and discarded. Solubilization is effected using cither 
guanidme HCl lOOmM Tres, 150mM NaCl, 50mM EDTA and 50mM EGTA. 
25 Reducing agents, such as Prmercaptoethanol (0.1-0.3 M) and dithiothreitol (0.1 M) in 
the presence of EDTA are also used to disrupt disulfide bonds. The soluble protein 
fraction is recovered by centrifugation and diluted lOx into a buffer containing lOOmM. 
This lOmM EDTA, 1% monothioglycerol and .25 M L-arginine, pH 9.5. The mixture 
is stored for 2 hours at 4'*C and the centrifiigation is repeated. Soluble protein is 
30 dialyzed in PBS to remove the monothioglycerol. 

An acid phosphatase based assay is used to determine the level of 
proliferation induced by the addition of VEGF to human microvascular endothelial 
cells (HlvrVEC). Cells were seeded on Collagen 1 coated 96.well plates al 2.5X10 
cells/well in assay media. After overnight incubation, VEGF is added to each well in 
35 assay media. In general, concentrations from 10^^ to lO'^^M of each test compound is 
used. Cells are incubated for 3 days and fresh media containing the various VEGF 
compounds is added. Cells are assayed by a standard acid phosphatase assay on day 6. 
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Protein synthesis inhibition is measured in a cell-free system. Samples 
are diluted in PBS, and 5 \xl of diluted sample is added to 5 nl of reaction buffer 
containing 0^5 Brome mosaic virus RNA, 0.5 ^1 of a 1 mM amino acid mixture 
lacking leucine, 12.5 ^iCi ^ leucine, and 15 |il of a nuclease*treated rabbit reticulocyte 
5 lysate. Samples are incubated for 1 hour at 30**C. The incubation is terminated by the 
addition of 375 jil of IM NaOH in 2% H2O2. After 20 minutes, the volume is adjusted 
to 1.6 ml with H2O and 50 ^l of the assay mixture, in duplicate, is transferred to a 
Multiscreen-HA 96 well filtration plate (Millipore, Bedford, MA, U.S.A.). Protein 
contents of each well were precipitated by adding 250 jil of ice*cold 30% 

10 trichloroacetic acid in 2% casamino acid. After incubation on ice for 30 min, TCA 
precipatable material was collected by washing the wells three times with 250 nl of ice 
cold 5% trichloroacetic acid.. After drying, filter paper circles were pimched out of the 
96-well plate, inserted into vials containing 5 ml of BetaMax scintillation fluid (ICN, 
Costa Mesa, CA, U.S.A.) and counts were determined using a Beckman LS 6000SC 

1 5 liquid scintillation coimter. 

As shown in Figure 1, both VEGF]21 VEGF155 were produced in 
appreciable quantities at 2 hrs post-induction. Moreover dimerization occurred after 
refolding (Figure 2). VEGF]21 and VEGF]65 purified from incluision bodies were able 
to induce proliferation of HMVEC cells as assayed by an acid phosphatase assay. 

20 HMVEC were grown for 72 hours in media lacking bFGF but which contained the test 
compounds in varying concentrations. On day 3 (72 hrs), an acid phosphatase assay 
was performed following standard procedures. Both the chemical conjugate of insect 
cell derived VEGF165 conjugated to SAP (CCSV) and SAP-VEGF121 (FPSV) 
produced from inclusion bodies in E, colt were able to inhibit the proliferation of 

25 HMVEC in a dose dependent manner at concentrations as low as 10-^M as compared to 
the level of stimulation seen with the addition of VEGF121 produced in insect cells. 
FPSV is a more potent inhibitor of cellular proliferation than CCSV. CCSV = chemical 
conjugate VEGF165-SAP; FPSV = SAP-VEGF121 made in E, coli from inclusion 
bodies; VEGF121 = insect cell derived; Saporin = SAP. (Figure 7) 

30 Expression of VEGF121-SAP and VEGF155-SAP was also appreciable 

in the pPl-?- system (Figure 4) and dimerized (Figure 5). In addition. VEGF121 
inhibited protein synthesis in a cell-fi^e system indicating that SAP portion of the 
conjugate retained its ribosomal inactivating activit>\ (Figure 6) 

35 
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EXAMPLE? 

Chemical Synthesis OF VEGF-SAP AND SAP-VEGF 

About 50-100 nmol of a VEGF, which is dialyzed against phosphate- 
5 buffered saline, is added to about 2.5 mg mono-derivatized SAP (a 1 .5 molar excess 
over the VEGF protein) and left on a rocker platform overnight. The ultraviolet-visible 
wavelength spectrum is checked in order to determine the extent of reaction by the 
release of pyridylthione, which adsorbs at 343 nm with a known extinction coefficient. 
The reaction mixtures are treated for purification in the following manner: reaction 
10 mixture is passed over a HiTrap heparin-Sepharose column (1 ml) equilibrated with 
0. 1 5 M sodium chloride in buffer A at a flow rate of 0.5 ml/min. The column is washed 
with 0.6 M NaCl and 1 .0 M NaCl in buffer A and the product eluted with 4.0 M NaCl 
in buffer A. Fractions (0.5 ml) are analyzed by gel electrophoresis and absorbance at 
280 nm. Peak tubes were pooled and dialyzed versus 1 0 mM sodium phosphate, pH 
15 7.5 and applied to a Mono S 5/5 column equilibrated with the same buffer. A 10 ml 
gradient between 0 and 1.0 M sodium chloride in equilibration buffer are used to elute 
the product. 



20 EXAMPLE 8 

Preparation of VEGF-SAP Conjugates That Contain Linkers Encoding 

Proteose Substrates 

A. Synthesis of oligos encoding protease substrates 

25 Complementary single-stranded oligos in which the sense strand 

encodes a protease substrate, have been synthesized either using a cyclone machine 
(Millipore) according the instructions provided by the manufacturer or. if greater than 
80 bases, are made by Midland Certified Reagent Co. (Midland, TX). The following 
oligos have been synthesized and can be introduced into constructs encoding 

30 SAPiVEGF, VEGF:SAP as described above EXAMPLES 3 and 4. 

1. Cathepsin B substrate linker: 

5'. CCATGGCCCTGGCCCTGGCCCTGGCCCTGCCATGG SEQ ID NO. 38 

2. Cathepsin D substrate linker 

5'. CCATGGGCCGATCGGGCTTCCTGGGCTTCGGCTTCCTGG 

35 GCTTCGCCATGG -3* SEQ ID NO. 39 

3- Trypsin substrate linker 
5*. CCATGGGCCGATCGGGCGGTGGGTGCGCTGGTAATAGAGT 
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CAGAAGATCAGTCGGAAGCAGCCTGTCTTGCGGTGGTCTC 
GACCTGCAGG CCATGG-3' SEQ ID NO. 44 

4. Gly4Ser 

5'- CCATGGGCGG CGGCGGCTCT GCCATGG -3' SEQ ID NO. 40 
5 5. (Gly4Ser)2 

5'- CCATGGGCGGCGGCGGCTCTGGCGGCGGCGGCTC 
TGCCATGG -3* SEQ ID NO. 4 1 

6. (Ser4Gly)4 

5'- CCATGGCCTCGTCGTCGTCGGGCTCGTCGTCGTCGGGCT 
1 0 CGTCGTCGTCGGGCTCGTCGTCGTCGGGCGCC ATGG -3' SEQ ID NO. 42 

7. (Ser4Gly)2 

5- CCATGGCCTCGTCGTCGTCGGGCTCGTCGTCGTC 
GGGCGCCATGG -3' SEQ ID NO. 43 

8. Thrombin substrate linker 
15 CTG GTG CCG CGC GGC AGC SEQ ID NO. 45 

Leu Val Pro Arg Gly Ser 

9. Enterokinase substrate linker 
GAC GAC GAC GAC CCA SEQ ID NO. 46 

Asp Asp Asp Asp Lys 
20 10. Factor Xa substrate 

ATCGAAGGTCGT SEQ ID NO. 47 

IleGluGiyArg 

11. Subtilisin linker 
Xaa Ala His Tyr SEQ ID NO. 50, where Xaa is preferably Phe (see SEQ ID NO. 49). 



25 



30 



B. Preparation of DNA constructs encoding SAP-Linker-VEGF 

These constructs arc prepared as described above for the SAP- 
(GJy4Ser)x-VEGF conjugates, except that DNA encoding the desired protease substrate 
is included in place of the DNA encoding (Gly4Ser)x. 



EXAMPLE 9 

Cam Assay For AngickjEnesis Inhibftion 

35 Materials 

Fertilized eggs are supplied by Melody Hill Ranch, Aptos, CA. L-[U* 
C] proline (specific activit>% 290 mCi/mmole) is purchased from New England 
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Nuclear, Boston, MA. Type VII collgenase may be obtained from Sigma Chemical 
Co., St. Louis, MO. Silcone ring cups are obtained by cutting silicone tubing (3nun 
diameter) into small "O" rings of Inun in thickness. These silicone ring cups can be 
reused many times if they are sterilized prior to each assay. 

5 

Compound Preparation 

VEGF protein and peptide-based compounds are dissolved in water 
containing 0.5% methyl cellulose for testing. In general, 10 ^il of protein solution is 
implanted on each CAM. 

10 

Development of th e CAM Assay for Anciopenesis Inhibition 

The riiethod of Folkman et al. {Developmental Biology ^7.391-394, 

1 974) with minor modifications, is used to cultivate chicken embryos as follows: 

Fresh fertile eggs are incubated for three days in a standard egg 

15 incubator. On Day 3, eggs are cracked under sterile conditions and embryos are placed 
into 20 X 100mm plastic petri dishes and cultivated at 37*'C in an embryo incubator 
with a water reservoir on the bottom shelf. Air is continuously bubbled into the water 
reservoir using a small pump such that the humidity in the incubator is kept constant. 
On Day 9, a sterile silicone ring cup is placed on each CAM and 0.25 |iCi of C- 

20 proline with or without the test materials dissolved in 0.5% methyl cellulose is 
delivered into each ring cup in a sterile hood. Ten embryos will be used in all control 
and test groups. After implantation 6f test materials, embryos are returned to the 
incubator and cultivation continued. On Day 12, all embryos are transferred into a cold 
room at 4-6**C, The antiangiogenic effect of each compound is first examined under the 

25 microscope with 6x power followed by coUagenase assay to give avascular zone 
scoring and *^C-proline incorporation into collagenous protein respectively. All 
embryos are kept on ice while scoring for avascular zone. Three color photographs vrill 
be taken of representative CAMs from each group that demonstrate significant positive 
responses. 

30 CoUagenase A^fgav fnr Measurement of ^ C^Proline Incorporation in rollagcnOUS 
Protein 

A piece of CAM 10mm in diameter is cut off under each ring cup and 
placed in a separate tube. l.OmL of phosphate-buffered saline (PBS, pH 7.3) 
containing 0.1 1 and 0.1 7mg of cycloheximide and dipyridyl respectively is added. The 
35 tubes are placed in a boiling water bath for 10 minutes and then cooled to room 
temperature. The PBS in each tube is discarded after centriftigation at 3000 x g for 10 
minutes. The CAM residue is washed once vrith 3mL of 15% TCA followed by 3 x 
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with 3inL of 5% TCA. Centrifugalion is carried out as above between each washing. 
At this point, all non-protein bound radioactivity is removed and the CAM containing 
the newly synthesized '"^C-collagenous protein is suspended in 0.9niL of 0.1 NaOH and 
1.1 mL of HEPES buffer at pH 7.4. The pH of the sample is neutralized with 0.8 N 
5 HCI using phenol red as indicator. 

To digest the '"^C-collagenous protein, 7.5 units of collagenase and 500 
nmoles of calcium chloride in 40 micro-liter of HEPES buffer is added to the above 
samples, and the mixtures are incubated at 37*'C for 4 hours. The reaction is stopped by 
adding l.OmL of 20% TCA containing 5mg of tannic acid into each tube. After 

10 vortexing, the samples are centrifbged at 3000 x g for 10 minutes. An aliquot of the 
clear supernatant is taken for scintillation counting to quantitate the radiolabeled 
tripeptides coresponding to basement membrane collagen and other collagenous 
materials synthesized by the CAM from '"^C-proline. The CAM pellets in each tube are 
solubilized in 0,5mL of 1.0 N NaOH by boiling in a water bath for 5 minutes. An 

1 5 aliquot of the dissolved CAM is used for protein determination using the method of 
Lowry (J. Biol Chem, /Pi. 265-273, 1951). The radioactivity per mg of protein from 
the CAM treated with a test compound relative to that from the control CAM gives the 
percent of inhibition. 

Since modifications will be apparent to those of skill in this art, it is 
20 intended that this invention be limited only by the scope of the appended claims. 
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(1) GENERAL INFORMATION: 

(i) APPLICANT : Barbara A. Sosnowski 

Kim Victor 
Lou Houston 
Michael Nova 

(ii) TITLE OF INVENTION: CONJUGATES OF VEGF WITH TARGETED AGENTS 
NUMBER OF SEQUENCES: 103 

CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Brown, Martin, Haller & McClain 

(B) STREET: 1660 Union Street 

(C) CITY: San Diego 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 92101-2926 

(v) COMPUTER READABLE FORM: 

(A) MEDItIM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

CA) APPLICATION NUMBER: 08/213,446 

(B) FILING DATE: 15-MAR-1994 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/213,447 

(B) FILING DATE: 15-MAR-1994 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Seidman, Stephanie L. 

(B) REGISTRATION NUMBER: 33,779 

(C) REFERENCE/DOCKET NUMBER: 519522 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (619)238-0999 

(B) TELEFAX: (619)238-0062, 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS : single 

(D) TOPOIiCXSY: linear 

(ii) MOLECXn^ TYPE: DNA (genomic) 
(iv) ANTI- SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: misc^recomb 

(B) LOCATION: 6 . . 11 

(D) OTHER INFORMATION: /standard_name=r "EcoRI Restriction Site" 

(ix) FEATURE: 

(A) NAME/KEY: sigj)eptide 

(B) LOCATION: 12.. 30 

(D) OTHER INFORMATION: /function^ "N- terminal extension" 
/products "Native saporin signal peptide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CTGCAGAATT CGCATGGATC CTGCTTCAAT 30 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: YES 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 6.. 11 

(D) OTHER INFORMATION: /standard_name= "EcoRI Restriction Site" 

(ix) FEATURE: 

(A) NAME/KEY: terminator 

(B) LOCATION: 23.. 25 

(D) OTHER INFORMATION: /note= "Anti- sense stop codon" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 26.. 30 

(D) OTHER INFORMATION: /note« "Anti -sense to carboxyl 
terminus of mature pepti-de" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



CTGCAGAATT OGCCTOGTTT GACTACTTTG 



30 
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(2) INFORMATION FOR SEQ IP NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 804 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 . . 804 

(D) OTHER INFORMATION: /note» "Nucleotide sequence 

corresponding to the clone M13 nipie-G4 in Exatnple I.B,2. 

(ix) FEATURE: 

(A) NAME/KEY: tnat_peptide 

(B) LOCATION: 46.. 804 

(D) OTHER INFORMATION: /product^ ""Saporin 



n n 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GCA TGG ATC CTG CTT CAA TTT TCA GCT TGG ACA ACA ACT GAT GCG GTC 48 
Ala Trp He Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 

-15 -10 . -5 1 

* ■ 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 96 
Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 

5 10 15 

TCT TTT GTG GAT AAA ATC CGA AAC AAT GTA AAG GAT CCA AAC CTG AAA 14 
Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 
20 25 30 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA 19: 
Tyr Gly Gly Thr Asp lie Ala Val He Gly Pro Pro Ser Lys Glu Lys 
35 40 45 

TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 24 
Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 
50 55 60 65 

CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 28 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 

70 75 80 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC 33 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala 
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85 90 95, 

GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 384 
Glu Jjgu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 110 

TTA GAA TAG ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn Ala Gin He 
115 120 125 

ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA 4B0 
Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu 

135 140 145 

CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 528 
Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 

150 155 160 

AAC GAA GCT A6G TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA 576 
Asn Glu Ala Arg Phe Leu Leu He Ala He Glh Met Thr Ala Glu Val 

165 170 175 

GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC 624 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn 
180 185 190 

AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT 672 
Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg 
195 200 205 

AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT 72 0 

Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 

215 220 225 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 768 
Lys Asp Tyr Asp Phe. Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 

230 235 240 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 804 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 

245 250 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1-.804 
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(ix) FEATURE: 

<A) NAME/KEY: misc_f eature 
(B) LOCATION: 1..804 

(D) OTHER INFORMATION: /notetr "Nucleotide sequence 

corresponding to the clone M13 tirpie-Gl in Example I.B.2.'' 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46.. 804 

(D) OTHER INFORMATION: /product- "Saporin" 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GCA TGG ATC CTG CTT CAA TTT TCA 6CT TGG ACA ACA ACT GAT GCG GTC 4 8 

Ala Trp lie Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
-15 -10 -5 1 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 96 
Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 

5 10 IS 

TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA 144 
Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro Asn lieu Lys 
20 25 30 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA 192 
Tyr Gly Gly Thr Asp lie Ala Val lie Gly Pro Pro Ser Lys Glu Lys 
35 40 45 

TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 240 
Phe Leu Arg lie Asn Phe Gin Ser Ser Axg Gly Thr Val Ser Leu Gly 
SO 55 ' 60 65 

CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 288 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 

70 75 80 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AGA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu lie Thr Ser Ala 

85 90 95 

GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 384 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 110 

TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala Gin lie 
115 120 125 

ACA CAG GGA GAT AAA TCA AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA 480 

Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly lie Asp Z^eu 

130 135 140 145 

CTT TTG ACG TCC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 528 
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Leu Leu Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 

150 155 2.B0 

AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA 576 
Asn Glu Ala Arg Phe Leu Leu lie Ala He Gin Met Thr Ala Glu Val 

165 170 175 

GCA CGA TTT CGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC 624 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn 
180 185 190 

AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT 672 
Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg 
195 200 205 

AAG ATT TCT ACG GCA ATA TAC GGA GAT GCC AAA AAC GGC GTG TTT AAT 720 
Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 

230 235 240 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 804 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 

245 250 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid > 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..804 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..804 

(D) OTHER INFORMATION: /not e,« "Nucleotide sequence 

corresponding to the clone M13 mpl8-G2 in Example I.B.2." 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46.. 804 

(D) OTHER INFORMATION: /product^ "Saporin" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GCA TGG ATC CTG CTT CAA TTT TCA GCT TGG ACA ACA ACT <3AT GCG GTC 



48 



wo 96/06641 



106 

Ala Trp He Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
_15 -10 -5 1 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACT 6CG GOT CAA TAC TCA 96 
Thr Ser He Thr I.eu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 

5 10 15 

TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA 144 
Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 

20 25 30 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAT AAA 192 
Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys Asp Lys 
35 40 45 

TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 
Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 

50 55 60 65 

CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 

70 75 80 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala 

85 90 95 



GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 HO 

TTA GAA TAC ACA GAA GAT TAT CAG TCG |kTC GAA AAG AAT GCC CAG ATA 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala Gin He 
lis 120 3-25 

ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA 
Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu 
130 135 140 "5 

CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 
Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 

150 155 160 

AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA 
Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr Ala Glu Val 

165 170 175 

GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn 

180 185 150 

AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT 
Lys Phe Asp ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg 
195 200 205 



240 



288 



3B4 



432 



480 



528 



576 



624 



672 
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AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT 720 
Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 

230 235 240 



804 



CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 

245 250 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..804 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1 . . 804 

(D> OTHER INFORMATION: /note- "Nucleotide sequence 

corresponding to the clone M13 nipl8-G7 in Example I.B-2. 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46.. 804 

(D) OTHER INFORMATION: /product- "Saporin" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

GCA TGG ATC CTG CTT CAA TTT TCA GCT TGG ACA ACA ACT GAT GCG GTC 48 
Ala Trp lie Leu Z^u Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
-15 -10 -5 1 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 96 
Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 

5 10 15 

TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA 144 
Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 
20 25 30 

TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA 192 
Tyr Gly Gly Thr Asp lie Ala Val He Gly Pro Pro Ser Lys Glu Lys 
35 40 45 



wo 96^641 PCr/US95fl0973 



108 



TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 240 
Phe Leu Arg lie Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 
SO 55 60 65 

CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 288 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 

70 75 80 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AGA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr *ryr Phe Arg Ser Glu lie Thr Ser Ala 

85 90 95 

GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA 6CT 384 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 110 

TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala Gin lie 
115 120 125 

ACA CAG GGA GAT AAA TCA AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA 480 
Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly lie Asp Leu 
130 135 140 145 

CTT TTG ACG TCC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 528 
Leu t^u Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 

150 155 160 

AAC GAA GCT AGA TTC CTT CTT ATC GCT ATT CAG ATG ACG GCT GAG GCA 576 
Asn Glu Ala Arg Phe Leu Leu lie Ala lie Gin Met Thr Ala Glu Ala 

16 5 170 175 

GCA CGA TTT AGG TAC ATA CAA AAC TTG G^TA ATC AAG AAC TTT CCC AAC 624 
Ala Arg Phe Arg Tyr He Gin Asn Leu Val He Lys Asn Phe Pro Asn 
180 185 190 

AAG TTC AAC TCG GAA AAC AAA GTG ATT CAG TTT GAG GTT AAC TGG AAA 672 
Lys Phe Asn Ser Glu Asn Lys Val He Gin Phe Glu Val Asn Trp Lys 
195 200 205 

AAA ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT 720 
Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn 
210 215 220 225 

AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG 768 
Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu 

230 235 240 

CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 804 
Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 

245 250 * 

(2) INFORMATION FOR SEQ ID NO: 7: 



<i) SEQUENCE CHARACTERISTICS: 
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(A) I*ENGTH: 804 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : doxible 
(D) TOPOLOGY: vmknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..804' 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..804 

(D) OTHER INFORMATION: /note- -Nucleotide sequence 

corresponding to the clone M13 nrpl8-G9 in Example I.B.2. 

(ix) FEATURE: 

(A) NAME/KEY: matj>eptide 

(B) LOCATION: 46.. 804 

(D) OTHER INFORMATION: /product » "Sapor in** 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

GCA TGG ATC CTG CTT CAA TTT TCA GCT TGG ACA ACA ACT GAT GCG GTC 48 
Ala Trp He Leu Leu Gin Phe Ser Ala Trp Thr Thr Thr Asp Ala Val 
-15 -XO -5 1 

ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA 96 
Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser 

5 10 15 



TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA 
Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro Asn Leu Lys 
20 25 30 



144 



TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA 192 
Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser Lys <31u Lys 
35 40 45 

TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC 240 
Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly 
50 55 60 65 

CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC 288 
Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn 

70 75 80 

ACG AAT GTT AAT CGG GCA TAT TAC TTC AGA TCA GAA ATT ACT TCC GCC 336 
Thr Asn Val Asn Arg Ala Tyr Tyr Phe Arg Ser Glu He Thr Ser Ala 

85 90 ' 95 

GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT 384 
Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala 
100 105 110 
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TTA GAA TAC ACA GAA GAT TAT CAG TCG ATT GAA AAG AAT GCC CAG ATA 432 
Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala Gin lie 
115 120 125 

ACA CAA GGA GAT CAA AGT AGA AAA GAA CTC GGG TTG GGG ATT GAC TTA 480 
Thr Gin Gly Asp Gin Ser Arg Lys Glu Leu Gly Leu Gly lie Asp Leu 
130 135 140 145 

CTT TCA ACG TCC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA 528 
Leu Ser Thr Ser Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys 

150 155 160 



GAC GAA GCT 
Asp Glu Ala 



GCG CGA TTT 
Ala Arg Phe 
180 

AAG TTC AAC 
Lys Phe Asn 
195 

AAA ATT TCT 
Lys lie Ser 
210 

AAA GAT TAT 
Lys Asp Tyr 

CAA ATG GGA 
Gin Met Gly 



AGA TTC CTT 
Arg Phe Leu 
165 

AGG TAC ATA 
Arg Tyr lie 



TCG GAA AAC 
Ser Glu Asn 



ACG GCA ATA 
Thr Ala lie 
215 

GAT TTC GGG 
Asp Phe Gly 
230 

CTC CTT ATG 
Leu Leu Met 
245 



CTT ATC GCT 
Leu lie Ala 
170 

CAA AAC TTG 
Gin Asn Leu 
185 

AAA GTG ATT 
Lys Val lie 
200 

TAC GGG GAT 
Tyr Gly Asp 



TTT GGA AAA 
Phe Gly Lys 

TAT TTG GGC 
Tyr Leu Gly 
250 



ATT CAG ATG 
lie Gin Met 



GTA ATC AAG 
Val He Lys 



CAG TTT GAG 
Gin Phe Glu 
205 

GCC AAA AAC 
Ala Lys Asn 

220 

GTG AGG CAG 
Val Arg Gin 
235 

AAA CCA AAG 
Lys Pro Lys 



ACG GCT GAG 
Thr Ala Glu 
175 

AAC TTT CCC 
Asn Phe Pro 
190 

GTT AAC TGG 
Val Asn Trp 



GGC GTG TTT 
Gly Val Phe 



GTG AAG GAC 
Val Lys Asp 
240 



GCA 576 
Ala 



AAC 624 
Asn 



AAA 672 
Lys 

AAT 72*0 
Asn 

225 

TTG 768 
Leu 

804 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: Tnisc_recoinb 

(B) LOCATION: 10 . . 15 

(D) OTHER INFORMATION; /standard_naine= "Nco I restriction enzyme 
recognition site" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 
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(B) LOCATION: 15.. 22 

(D) OTHER INFORMATION: /product- "N- terminus of Saporin 
protein" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CAACAACTGC CATGGTCACA TC 22 
(2) INFORMATION FOR SEQ ID NO: 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIX>GY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 11.. 1€ 

(D) OTHER INFORMATION: /8temdard_name» "Nco I restriction enzyme 
recognition site . " 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1 . . 10 

(D) OTHER INFORMATION: /product- "Carbbxy terminus of 
mature FGF protein" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GCTAAGAGCG CCATGGAGA 19 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 12 

(D) OTHER INFORMATION: /product= -Carboxy terminus of 
wild type FGF** 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 13.. 18 

(D) OTHER INFORMATION: /standard_naine= "Nco I restriction enzyme 
recognition site" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

GCT AAG AGC TGACCATGGA GA 21 
Ala Lys Ser 
1 

(2) INFORMATION FOR SEQ ID NO; 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIXDGY: linear 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 96 

(D> OTHER INFORMATION: /product* -pFGFNcol- 

/note* "Equals the plasmid pFC80 wih native FGF 
stop codon removed." 

(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 29.. 34 

(D) OTHER INFORMATION: /standard_nanie» "Nco I restriction enzyme 
recognition site" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

_*» 

CTT TTT CTT CCA ATG TCT GCT AAG AGC otc ATG GAG ATC CGG CTG AAT ^ 48 
Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Glu lie Arg I#eu Asn 
1 5 10 15 

GGT GCA GTT CTG TAC CGG TTT TCC TGT GCC GTG TTT CAG GAC TCC TGAAATCTT 102 

Gly Ala Val Leu Tyr Arg Phe Ser Cys Ala Val Phe Gin Asp Ser 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : xm)cnown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1230 



(ix) FEATURE: 
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(A) NAME/KEY: inat_peptide 

(B) LOCATION: 1..465 

(D) OTHER INFORMATION: /product^ "bFGF" 

( ix) FEATURE : 

(A) NAME/KEY: mat_peptide 

(B) LOCATION : 472 . - 1230 

(D) OTHER INFORMATION: /product- "Saporin** 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



ATG GCA GCA GGA TCA ATA ACA ACA TTA CCC GCC TTG CCC GAG GAT GGC 48 
Met Ala Ala Gly Ser lie Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
15 10 15 

GGC AGC GGC GCC TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 

20 25 30 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg lie His Pro Asp Gly Arg 
35 40 45 

GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 192 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His lie Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 240 
Gin Ala Glu Glu Arg Gly Val Val Ser lie Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TJlA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg lieu Leu Ala Ser Lys Cys 

85 90 95 

GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 336 

Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 

100 105 110 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala Xieu Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GTC ACA TCA 480 
Ala lie Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Val Thr Ser 
145 150 155 160 



ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT 
lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe 

165 170 175 



528 
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GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT 576 
Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro Asn I#eu Lys Tyr Gly 

180 185 190 

GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT 624 
Gly Thr Asp lie Ala Val lie Gly Pro Pro Ser Lys Glu Lys Phe Leu 
195 200 205 

AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA 672 
Arg lie Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys 
210 215 220 

CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT 720 
Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn 
225 230 235 240 

GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA 768 
Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu lie Thr Ser Ala Glu Leu 

245 250 255 

ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA 816 
Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu 

260 265 270 

TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG 864 
Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn Ala Gin lie Thr Gin 
275 280 285 

GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA CTT TTG 912 
Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly lie Asp Leu Leu Leu 
290 295 300 

ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA 960 
Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu 
305 310 3X5 320 

GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA 1008 
Ala Arg Phe X^u Leu lie Ala lie Gin Net Thr Ala Glu Val Ala Arg 

325 330 335 

TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC 1056 
Phe Arg Tyr lie Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe 

340 345 350 

GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT 1104 
Asp Ser Asp Asn Lys Val lie Gin Phe Glu Val Ser Trp Arg Lys He 
355 360 365 

TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT 1152 
Ser Thr Ala lie Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp 
370 375 ' 380 

TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG 1200 
Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met 
385 390 395 400 
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GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 1230 
Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 

405 410 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

<B) LOCATION: 1..1230 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..465 

(D) OTHER INFORMATION: /products "bFGF'» 

(ix) FEATURE: 

(A) NAME/KEY: mat^peptide 

(B) LOCATION: 472. .1230 

(D) OTHER INFORMATION: /product^ "Saporin" 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

V 

ATG GCT GCT GGT TCT ATC ACT ACT CT6 CCG GCT CTG CCG GAA GAC GGT 48 
Met Ala Ala Gly Ser lie Thr Thr Leu Pro Ala Leu Pro Glu Asp Gly 
1 5 10 15 

GGT TCT GGT GCT TTC CCG CCC GGC CAC TTC AAG GAC CCC AAG CGG CTG 96 
Gly Ser Gly Ala Phe Pro Pro Gly His Phe Lys Asp Pro Lys Arg Leu 

20 25 30 

TAC TGC AAA AAC GGG GGC TTC TTC CTG CGC ATC CAC CCC GAC GGC CGA 144 
Tyr Cys Lys Asn Gly Gly Phe Phe Leu Arg lie His Pro Asp Gly Arg 
35 40 45 

GTT GAC GGG GTC CGG GAG AAG AGC GAC CCT CAC ATC AAG CTT CAA CTT 192 
Val Asp Gly Val Arg Glu Lys Ser Asp Pro His lie Lys Leu Gin Leu 
50 55 60 

CAA GCA GAA GAG AGA GGA GTT GTG TCT ATC AAA GGA GTG TGT GCT AAC 240 
Gin Ala Glu Glu Arg Gly Val Val Ser lie Lys Gly Val Cys Ala Asn 
65 70 75 80 

CGT TAC CTG GCT ATG AAG GAA GAT GGA AGA TTA CTG GCT TCT AAA TGT 288 
Arg Tyr Leu Ala Met Lys Glu Asp Gly Arg Leu Leu Ala Ser Lys Cys 

85 90 95 
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GTT ACG GAT GAG TGT TTC TTT TTT GAA CGA TTG GAA TCT AAT AAC TAC 336 
Val Thr Asp Glu Cys Phe Phe Phe Glu Arg Leu Glu Ser Asn Asn Tyr 

100 105 110 

AAT ACT TAC CGG TCA AGG AAA TAC ACC AGT TGG TAT GTG GCA TTG AAA 384 
Asn Thr Tyr Arg Ser Arg Lys Tyr Thr Ser Trp Tyr Val Ala I^u Lys 
115 120 125 

CGA ACT GGG CAG TAT AAA CTT GGA TCC AAA ACA GGA CCT GGG CAG AAA 432 
Arg Thr Gly Gin Tyr Lys Leu Gly Ser Lys Thr Gly Pro Gly Gin Lys 
130 135 140 

GCT ATA CTT TTT CTT CCA ATG TCT GCT AAG AGC GCC ATG GTC ACA TCA 480 
Ala lie Leu Phe Leu Pro Met Ser Ala Lys Ser Ala Met Val Thr Ser 
145 150 155 160 

ATC ACA TTA GAT CTA GTA AAT CCG ACC 6CG GGT CAA TAC TCA TCT TTT 528 
lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe 

165 170 175 

GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT 576 
Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly 

180 185 190 

GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT 624 
Gly Thr Asp lie Ala Val lie Gly Pro Pro Ser Lys Glu Lys Phe Leu 
195 200 205 

AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA 672 
Arg lie Asn Phe Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys 
210 215 220 

CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT 720 
Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn 
225 230 235 240 

GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA 768 
Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu lie Thr Ser Ala Glu Leu 

245 250 255 

ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA 816 
Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu 

260 265 270 

TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG 864 
Tyr Thr Glu Asp Tyr Gin Ser lie Glii Lys Asn Ala Gin lie Thr Gin 
275 280 285 

GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA CTT TTG 912 
Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly He Asp Leu Leu Leu 
290 295 300 

ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA 960 
Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu 
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305 310 315 320 

GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA 
Ala Arg Phe Leu I^u lie Ala He Gin Met Thr Ala Glu Val Ala Arg 

325 330 335 



1008 



TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC 1056 

Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe 

340 345 350 

GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT 1104 

Asp Ser Asp Asn Lys Val He Gin Phe Glu Val Ser Trp Arg Lys He 
355 360 365 

TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT 1152 

Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp 
370 375 3B0 

TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG 1200 

Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met 

385 390 395 400 

GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 1230 

Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 

405 410 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(D) OTHER INFORMATION: /product** trp promoter 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

AATTCCCCTG TTGACAATTA ATCATCGAAC TAGTTAACTA GTACGCAGCT TGGCTGCAG 59 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
<ix) FEATURE: 
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(D) OTHER INFORMATION/product= bacteriophage lambda cll ribosome 
binding site 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 

GTCGACCAAG CTTGGGCATA CATTCAATCA ATTGTTATCT AAGGAAATAC TTACATATG 59 

(2) INFORMATION FOR SEQ ID NO: 16 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : doxible 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1,.66 

(D) OTHER INFORMATION: /product^ VEGF gene EXON I (VEGF LEADER 
SEQUENCE -26 - -5) 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 16 

ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT GCC TTG CTG CTC 48 
Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu Ala Leu Leu Leu 
15 10 15 

TAC CTC CAC CAT GCC AAG 
Tyr Leu His His Ala Lys 

20 . ;> ' 

(2) INFORMATION FOR SEQ ID NO: 17 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 52 

(D) OTHER INFORMATION: /product^ VEGF gene EXON II 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

TGG TCC CAG GCT GCA CCC ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC 48 
Trp Ser Gin Ala Ala Pro Met Ala Glu Gly Gly Gly Gin Asn His His 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO:X8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 base pairs 

(B) TYPE I nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOUX3Y: both 



(ii) MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 197 

<D) OTHER INFORMATION: /product- VEGF gene EXON III 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

TG GTG AAG TTC ATG GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATG 47 
Val Lys Phe Met Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie 
^ S 10 15 

GAG ACC CTG GTG GAC ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC 95 
Glu Thr Leu Val Asp He Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr 

20 25 30 

ATC TTC AAG CCA TCC TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC 143 
He Phe Lys Pro Ser Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys 

35 40 45 

AAT GAC GAG GGC CTG GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC 191 
Asn Asp Glu Gly Leu Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr 

50 55 y 60 



ATG CAG 
Met Gin 
64 



197 



(2) INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 75 

(D) OTHER INFORMATION: /product* VEGF gene EXON IV 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 



ATT ATG CGG ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC 



48 
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He Met Arg He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser 
110 115 120 

TTC CTA CAG CAC AAC AAA TGT GAA TGC AG 77 
Phe Leu Gin His Asn Lys Cys Glu Cys 

125 130 135 140 

(2) INFORMATION FOR SEQ ID NO: 20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: both 

(ii> MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 27 

(D) OTHER INFORMATION: /product= VEGF gene EXON V 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

A CCA AAG AAA GAT AGA GCA AGA CAA GAA AA 30 
Pro Lys Lys Asp Arg Ala Arg Gin Glu 

5 

(2) INFORMATION FOR SEQ ID NO: 21 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs / 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: both 

(ii> MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 70 

{D> OTHER INFORMATION: /product^ VEGF gene EXON VI 
(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 21 

A AAA TCA GTT CGA GGA AAG GGA AAG GGG CAA AAA CGA AAG CGC AAG AAA 4 9 

Lys Ser Val Arg Gly Lys Gly Lys Gly Gin Lys Arg Lys Arg Lys Lys 
15 10 15 

TCC CGG TAT AAG TCC TGG AGC GT 

Ser Arg Tyr Lys Ser Trp Ser 

20 



(2) INFORMATION FOR SEQ ID NO: 22 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: both 

(ii) MOLiECULE TYPE: genomic 

(ix) FEATURE: 

<A) NAME/KEY: CDS 
(B) LOCATION: 1..51 

(D) OTHER INFORMATION: /product* Insert between EXON VI & VII 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

TAC GTT GGT GCC CGC TGC TGT CTA ATG CCC TGG AGC CTC CCT 6GC CCC 4 8 

Tyx Val Gly Ala Arg Cys Cys Leu Met Pro Trp Ser Leu Pro Gly Pro 
is 10 15 



51 



CAT 
His 

(2) INFORMATION FOR SEQ ID NO: 23 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 132 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxable 

(D) TOPOLOGY: both 

{ii> MOLECULE TYPE: genomic 

(ix) FEATURE: 

<A) NAME/KEY: CDS 
(B) LOCATION: 2.. 130 

<D) OTHER INFORMATION: /product^ EXON VII 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

T CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT TTG TTT GTA CAA GAT 49 
Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val Gin Asp 
1 5 10 15 

CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT TGC AAG 97 
Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys Lys 

20 25 30 

GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AG 132 
Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys 
35 40 

(2) INFORMATION FOR SEQ ID NO: 24 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 



wo 96/06641 



FCT/US95/ia973 



122 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: genomic 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . . 1? 

(D) OTHER INFORMATION: /product= EXON VIII 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

A TGT GAC AAG CCG AGG CGG TGA 22 
Cys Asp Lys Pro Arg Arg 
15 

(2) INFORMATION FOR SEQ ID NO: 25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 456 

(D) OTHER INFORMATION: /prociuct= "VEGF121- encoding DNA" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 90 

(D) OTHER INFORMATION: /product^ leader-encoding sequence 
(xi) SEQXJENCE DESCRIPTION: SEQ ID NO:25 

GGATCCGAAA CC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT 48 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu 
15 10 

GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC 96 
Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro 
15 20 25 

ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG 144 
Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 
30 35 ' 40 

GAT GTC TAT CAG CGG AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC 192 
Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val Asp 
45 50 55 60 
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ATC TTC CAG GAG TAG CCT GAT GAG ATC GAG TAG ATC TTG AAG CCA TCC 24 0 

lie Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 

65 70 75 

TGT GTG CCC CTG ATG CGA TGC GGG GGG TGC TGC AAT GAG GAG GGC CTG 288 
Cys Val Pro Ueu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu 

80 85 90 

GAG TGT GTG GGG ACT GAG GAG TCC AAG ATC ACC ATG CAG ATT ATG GGG 336 
Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr Met Gin He Met Arg 
95 100 105 

ATG AAA CCT GAG CAA GGG CAG GAG ATA GGA GAG ATG AGG TTG GTA GAG 384 
He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin 
110 115 120 

CAG AAG AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GGA AGA CAA GAA 432 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 
125 130 135 140 

AAA TGT GAG AAG GGG AGG GGG TGATGAATGA ATGAGGATCC 4 73 

Lys Cys Asp Lys Pro Arg Arg 

145 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDKA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13 . . 588 

(D) OTHER INFORMATION: /product* "VEGFig 5 -encoding DNA" 

( ix ) FEATURE : 

(A> NAME/KEY: CDS 
(B> LOCATION: 13 . . 90 

(D) OTHER INFORMATION: /product= "leader sequence -encoding DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

GGATGGGAAA CC ATG AAG TTT CTG CTG TGT TGG GTG CAT TGG AGG CTT 4 8 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser I#eu 
15 10 

GGC TTG CTG CTG TAC CTG CAG CAT GGG AAG TGG TGC GAG GCT GGA CCC 96 
Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro 
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15 20 2S 

ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG 144 
Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 
30 35 40 

GAT GTC TAT CAG CGC AGC TAG TGC CAT CCA ATC GAG ACC CTG GTG GAC 192 
Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val Asp 
45 50 55 60 

ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAG ATC TTC AAG CCA TGC 24 0 

He Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 

65 70 75 

TGT GTG CGC CTG ATG GGA TGG GGG GGG TGC TGC AAT GAC GAG GGG CTG 288 
Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu 

80 85 90 

GAG TGT GTG CGC ACT GAG GAG TGG AAG ATG AGG ATG GAG ATT ATG GGG 336 
Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr Met Gin He Met Arg 
95 100 105 

ATG AAA CCT CAG GAA GGG GAG GAG ATA GGA GAG ATG AGG TTC GTA CAG 384 
He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin 
110 115 120 

CAG AAG AAA TGT GAA TGG AGA GGA AAG AAA GAT AGA GCA AGA GAA GAA 432 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 
125 130 135 140 

AAT CGC TGT GGG CCT TGC TCA GAG GGG AGA AAG CAT TTG TTT GTA CAA 480 
Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His I#eu Phe Val Gin 

145 150 155 

GAT GGG GAG AGG TGT AAA TGT TGG TGG AAA AAG AGA GAG TGG CGT TGG 528 
Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys 

160 165 170 

AAG GGG AGG CAG GTT GAG TTA AAG GAA CGT ACT TGC AGA TGT GAC AAG 576 
Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp Lys 
175 180 185 

GGG AGG GGG TGATGAATGA ATGAGGATGG 
Pro Arg Arg 
190 

(2) INFORMATION FOR SEQ ID NO: 27 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 677 base pairs 

(B) TYPE: nucleic acid 
(G) STRANDEDNESS : double 
(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: CDNA 
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( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 657 

(D) OTHER INFORMATION: /product = "VEGFi 39- encoding DNA" 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 13 . . 90 

(D) OTHER INFORMATION: /product^ "leader sequence -encoding DNA- 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

GGATCCGAAA CC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT 48 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu 
15 10 

GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC 96 
Ala Leu Leu Leu Tyr I*eu His His Ala Lys Trp Ser Gin Ala Ala Pro 
15 20 25 

ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG 144 
Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 
30 35 40 

GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC 192 
Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val Asp 
45 50 55 60 

ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC 24 0 

lie Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro Ser 

65 70 > 75 

TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG 288 
Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu <31y Leu 

80 85 90 

GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG 336 
Glu Cys Val Pro Thr Glu Glu Ser Asn lie Thr Met Gin lie Met Arg 
95 100 105 

ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG 384 
lie Lys Pro His Gin Gly Gin His lie Gly Glu Met Ser Phe Leu Gin 
110 115 120 

CAC AAC AAA TGT GAA TGC AGA CCA AAG AAG GAT AGA GCA AGA CAA GAA 432 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 
125 130 135 140 

AAA AAA TCA GTT CGA GGA AAG GGA AAG GGG CAA AAA CGA AAG CGC AAG 4 80 

Lys Lys Ser Val Arg Gly Lys Gly Lys <31y Gin Lys Arg Lys Arg Lys 

145 150 155 



AAA TCC CGG TAT AAG TCC TGG AGC GTT CCC TGT GGG CCT TGC TCA GAG. 
Lys Ser Arg Tyr Lys Ser Trp Ser Val Pro Cys Gly Pro Cys Ser Glu 



528 
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160 165 170 

CGG AGA AAG CAT TTG TTT GTA CAA GAT CCG CAG ACG TGT AAA TGT TCC 576 
Arg Arg Lys His Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys Ser 
175 IBO 185 

TGC AAA AAC ACA GAC TCG CGT TGC AAG GCG AGG CAG CTT GAG TTA AAC 624 
Cys Lys Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu I*eu Asn 
190 195 , 200 

GAA CGT ACT TGC AGA TGT GAC AAG CCG AGG CGG TGATGAATGA ATGAGGATCC 677 
Glu Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 
205 210 215 



(2) INFORMATION FOR SEQ ID NO: 28 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 728 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13 . . 711 

(D) OTHER INFORMATION: /product= ••VEGF2 06 -encoding DNA" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13 - . 90 > 

(D) OTHER INFORMATION: /product^ leader sequence encoding DNA 
Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:28 

GGATCCGAAA CC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT 48 

Met Asn Phe Leu L.eu Ser Trp Val His Trp Ser Leu 
1 5 10 



GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC 
Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro 
IS 20 25 

ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG 
Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 
30 35 40 

GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC 
Asp val Tyr Gin Arg Ser Tyr Cys His Pro He Glu Thr Leu Val Asp 

45 50 33 

ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC 
He Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 

65 70 75 



96 



144 



192 



240 
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TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG 2B8 
Cys Val Pro L»eu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu 

80 85 90 

GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG 336 
Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr Met Gin He Met Arg 
95 100 105 

ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG 384 
He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin 
110 115 120 

CAC AAC AAA TGT GAA TGC AGA CCA AAG AAG GAT AGA GCA AGA CAA GAA 432 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 
125 130 135 140 

AAA AAA TCA GTT CGA GGA AAG GGA AAG GGG CAA AAA CGA AAG CGC AAG 480 
Lys Lys Ser Val Arg Gly Lys Gly Lys Gly Gin Lys Arg Lys Arg Lys 

145 150 155 

AAA TCC CGG TAT AAG TCC TGG AGC GTT TAC GTT GGT GCC CGC TGC TGT 528 
Lys Ser Arg Tyr Lys Ser Trp Ser Val Tyr Val Gly Ala Arg Cys Cys 

160 165 170 

CTA ATG CCC TGG AGC CTC CCT GGC CCC CAT CCC TGT GGG CCT TGC TCA 576 
Leu Met Pro Trp Ser Leu Pro Gly Pro His Pro Cys Gly Pro Cys Ser 
175 180 185 

GAG CGG AGA AAG CAT TTG TTT GTA CAA GAT CCG CAG ACG TGT AAA TGT 624 
Glu Arg Arg Lys His Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys 

190 195 > 200 

» ■ 

TCC TGC AAA AAC ACA GAC TCG CGT TGC AAG GCG AGG CAG CTT GAG TTA 672 
Ser Cys Lys Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu Leu 
205 210 215 220 

AAC GAA CGT ACT TGC AGA TGT GAC AAG GCG AGG CGG TGATGAATGA 718 
Asn Glu Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 
225 230 235 

ATGAGGATCC 728 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 
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(A> NAME/KEY: CDS 
(B) LOCATION: 4.-768 

(D) OTHER INFORMATION: /product" "SAP CYS +4" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 768 

(D) OTHER INFORMATION: /product* "mature SAP CYS +4" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CAT ATG GTC ACA TCA TGT ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT 48 
Met Val Thr Ser Cys Thr Leu Asp Leu Val Asn Pro Thr Ala Gly 
1 5 10 15 

CAA TAC TCA TCT TTT GT6 GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA 96 
Gin Tyr Ser Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro 

20 25 30 

AAC CTG AAA TAC GGT GGT ACC QAC ATA GCC GTG ATA GGC CCA CCT TCT 144 
Asn Leu Lys Tyr Gly Gly Thr Asp lie Ala Val He Gly Pro Pro Ser 

35 40 45 

AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC 192 
Lys Glu Lys Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val 
50 55 60 

TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA 240 
Ser Leu Gly Leu Lys Arg Asp TVsn Leu Tyr Val Val Ala Tyr Leu Ala 
65 70 75 

ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT 288 
Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He 
80 85 90 95 

ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT 336 
Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn 

100 105 110 

CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT 384 
Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn 

115 120 125 

GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG 432 

Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly 
130 135 140 

ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT 480 
He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg 
145 150 ^ 155 

GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA 528 
Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr 
160 165 170 175 



wo 96/06641 



129 



GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC 576 
Ala Glu Val Ala Arg Phe Arg Tyr lie Gin Asn l^u Val Thr Lys Asn 

180 185 190 



TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC 
Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val 

195 200 205 



624 



AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC 672 
Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly 
210 215 220 

GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG 720 
Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val 
225 230 235 

AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG TAG 768 
Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
240 245 250 255 



(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



{ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4.. 768 

(D) OTHER INFORMATION: /product« "SAP CYS +10" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 768 

(D) OTHER INFORMATION: /product^ "mature SAP CYS +10" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CAT ATG GTC ACA TCA ATC ACA TTA GAT CTA GTA TGT CCG ACC GCG GGT 4 8 

Met Val Thr Ser He Thr Leu Asp Leu Val Cys Pro Thr Ala Gly 
1 5 10 15 

CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA 96 
Gin Tyr Ser Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro 

20 25 30 

AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT 144 
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Asn Leu Lys Tyr Gly Gly Thr Asp lie Ala Val lie Gly Pro Pro Ser 

35 40 45 

AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC 192 
Lys Glu Lys Phe Leu Arg lie Asn Phe Gin Ser Ser Arg Gly Thr Val 
50 55 60 

TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA 240 
Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala 
65 70 75 

ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT 288 
Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu lie 
80 85 90 95 

ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT 336 
Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn 

100 105 110 

CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT 384 
Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn 

115 120 125 

GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG 432 
Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly 
130 135 140 

ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT 480 
He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg 
145 150 155 

GTG GTT AAA AAC GAA GCT AGG TTT CTG (pT ATC GCT ATT CAA ATG ACA 528 
Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr 
160 165 170 175 

GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC 576 
Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn 

180 185 190 

TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC 624 
Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val 

195 200 205 

AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC 672 
Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly 
210 215 220 

GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG . 720 
Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val 
225 230 235 



AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG TAG 
Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
240 245 250 255 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 4.. 1212 

(D) OTHER INFORMATION: /product* "VEGF121-SAP LEADER 
pZlB" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4 . . 81 

(D) OTHER INFORMATION: /product-: "LEADER" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CAT ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT GCC TTG CTG 4S 
Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu Ala Leu Leu 
15 10 15 

CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCA ATG GCA GAA 96 
Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Met Ala Glu 

20 25 30 

GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG GAT GTC TAT 144 
Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met Asp Val Tyr 

35 40 45 

CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC ATC TTC CAG 192 
Gin Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val Asp lie Phe Gin 
SO 55 60 

GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC TGT GTG CCC 24 0 

Glu Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro Ser Cys Val Pro 
65 70 75 

CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG GAG TGT GTG 288 
Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu Glu Cys Val 
80 85 90 95 

CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG ATC AAA CCT 336 
Pro Thr Glu Glu Ser Asn lie Thr Met Gin He Met Arg He Lys Pro 

100 105 110 



CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG CAC AAC AAA 
His Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin His Asn Lys 



384 
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115 120 125 

TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA GAA AAA TGT GAC 432 
Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu Lys Cys Asp 
130 135 140 

AAG CCG AGG CGG CCA TGG GTC ACA TCA ATC ACA TTA GAT CTA GTA AAT 480 
Lys Pro Arg Arg Pro Trp Val Thr Ser lie Thr Leu Asp Leu Val Asn 
145 150 155 

CCG ACC GCG GGT CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC 528 
Pro Thr Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys lie Arg Asn Asn 
160 165 170 175 

GTA AAG GAT CCA AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA 576 
Val Lys Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp lie Ala Val lie 

180 185 190 

GGC CCA CCT TCT AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC 624 
Gly Pro Pro Ser Lys Glu Lys Phe Leu Arg lie Asn Phe Gin Ser Ser 

195 200 205 

CGA GGA ACG GTC TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC 672 
Arg Gly Thr Val Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val 
210 215 220 

GCG TAT CTT GCA ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC 720 
Ala Tyr Leu Ala Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe 
225 230 235 

AAA TCA GAA ATT ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC 768 
Lys Ser Glu lie Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala 
240 245 250 255 

ACA ACT GCA AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG 816 
Thr Thr Ala Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser 

260 265 270 

ATC GAA AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA 864 
He Glu Lys Asn Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu 

275 280 285 

CTC GGG TTG GGG ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC 912 
Leu Gly Leu Gly He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn 
290 295 300 

AAG AAG GCA CGT GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT 960 
Lys Lys Ala Arg Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala 
305 310 315 

ATT CAA ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG 1008 
He Gin Met Thr Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu 
320 325 330 335 



GTA ACT AAG AAC TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT 



1056 
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Val Thr Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val lie 

340 345 350 

CAA TTT GAA GTC AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT 1104 
Gin Phe Glu Val Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp 

355 360 365 

GCC AAA AAC GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA 1152 
Ala Lys Asn Gly Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys 
370 375 380 

GTG AGG CAG GTG AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC 1200 
Val Arg Gin Val Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly 
385 390 395 

AAA CCA AAG TAG 1212 

Lys Pro Lys 

400 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS y 

(B) LOCATION: 4.. 1269 

(D) OTHER INFORMATION: /product- •'VEGF165-SAP NO LEADER 
pZlB" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

CAT ATG GCA CCA ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG 48 
Met Ala Pro Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val 
15 10 15 

GTG AAG TTC ATG GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG 96 
Val Lys Phe Met Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu 

20 25 30 

ACC CTG GTG GAC ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC 144 
Thr Leu Val Asp lie Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr lie 

35 40 45 

TTC AAG CCA TCC TGT GTG CCC CTG ATG C<3A TGC GGG GGC TGC TGC AAT 192 
Phe Lys Pro Ser Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn 
50 55 60 
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GAC GAG GGC CTG GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG 24 0 

Asp Glu Gly Leu Glu Cys Val Pro Thr Glu Glu Ser Asn lie Thr Met 
65 70 75 

CAG ATT ATG CGG ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG 288 
Gin lie Met Arg lie Lys Pro His Gin Gly Gin His lie Gly Glu Met 
80 85 90 95 

AGO TTC CTA CAG CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA 336 
Ser Phe Leu Gin His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg 

100 105 110 

GCA AGA CAA GAA AAT CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT 384 
Ala Arg Gin Glu Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His 

115 120 125 

TTG TTT GTA CAA GAT CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA 4 32 

Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr 
130 135 140 

GAC TCG CGT TGC AAG GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC 4 80 

Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys 
145 150 155 

AGA TGT GAC AAG CCG AGG CGG CCA TGG GTC ACA TCA ATC ACA TTA GAT 528 
Arg Cys Asp Lys Pro Arg Arg Pro Trp Val Thr Ser lie Thr Leu Asp 
160 165 170 175 

CTA GTA AAT CCG ACC GCG GGT CAA TAC TCA TCT TTT GTG GAT AAA ATC 576 
Leu Val Asn Pro Thr Ala Gly Gin Tyr Ser Ser Phe Val Asp Lys lie 

180 185 190 

CGA AAC AAC GTA AAG GAT CCA AAC CTG AAA TAC GGT GGT ACC GAC ATA 624 

Arg Asn Asn Val Lys Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp He 

195 200 205 

GCC GTG ATA GGC CCA CCT TCT AAA GAA AAA TTC CTT AGA ATT AAT TTC 672 
Ala Val lie Gly Pro Pro Ser Lys Glu Lys Phe Leu Arg He Asn Phe 
210 215 220 

CAA AGT TCC CGA GGA ACG GTC TCA CTT GGC CTA AAA CGC GAT AAC TTG 720 
Gin Ser Ser Arg Gly Thr Val Ser Leu Gly Leu Lys Arg Asp Asn Leu 
225 230 235 

TAT GTG GTC GCG TAT CTT GCA ATG GAT AAC ACG AAT GTT AAT CGG GCA 76 8 

Tyr Val Val Ala Tyr Leu Ala Met Asp Asn Thr Asn Val Asn Arg Ala 
240 245 250 255 

TAT TAC TTC AAA TCA GAA ATT ACT TCC GCC GAG TTA ACC GCC CTT TTC 816 
Tyr Tyr Phe Lys Ser Glu He Thr Ser Ala Glu Leu Thr Ala Leu Phe 

260 2^5 270 



CCA GAG GCC ACA ACT GCA AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT 
Pro Glu Ala Thr Thr Ala Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp 

275 280 285 



864 
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TAT CAG TCG ATC GAA AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT 912 
Tyr Gin Ser He Glu Lys Asn Ala Gin He Thr Gin Gly Asp Lys Ser 
290 295 300 

AGA AAA GAA CTC GGG TTG GGG ATC GAC TTA CTT TTG ACG TTC ATG GAA 960 
Arg Lys Glu Leu Gly L*eu Gly He Asp Leu Leu Leu Thr Phe Met Glu 
305 310 315 

GCA GTG AAC AAG AAG GCA CGT GTG GTT AAA AAC GAA GOT AGG TTT CTG 1008 
Ala Val Asn Lys Lys Ala Arg Val Val Lys Asn Glu Ala Arg Phe Leu 
320 325 330 335 

CTT ATC GCT ATT CAA ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT 1056 
Leu He Ala He Gin Met Thr Ala Glu Val Ala Arg Phe Arg Tyr He 

340 345 350 

CAA AAC TTG GTA ACT AAG AAC TTC CCC AAC AAG TTC GAC TCG GAT AAC 1104 
Gin Asn Leu Val Thr Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn 

355 360 365 

AAG GTG ATT CAA TTT GAA GTC AGC TGG CGT AAG ATT TOT ACG GCA ATA 1152 
Lys Val He Gin Phe Glu Val Ser Trp Arg Lys He Ser Thr Ala He 
370 375 380 

TAC GGG GAT GCC AAA AAC GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG 1200 
Tyr Gly Asp Ala Lys Asn Gly Val Phe Asn Lys Asp Tyr Asp Phe Gly 
385 390 395 

TTT GGA AAA GTG AGG CAG GTG AAG GAC TTG CAA ATG GGA CTC CTT ATG 1248 
Phe Gly Lys Val Arg Gin Val Lys Asp Leu Gin Met Gly Leu L»eu Met 
400 405 410 415 

TAT TTG GGC AAA CCA AAG TAG 1269 
Tyr Leu Gly Lys Pro Lys 

420 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1369 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 12.. 1352 

(D) OTHER INFORMATION: /product^ "VEGF165-SAP LEAM:R BAC" 



(ix) 



FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 12.. 89 
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(D) OTHER INFORMATION: /product = "LEADER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GGATCCGAAA C ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT GCC 50 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu Ala 
15 10 

TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCA ATG 98 
Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Met 
15 20 25 

GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG GAT 146 
Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met Asp 
30 35 40 45 

GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC ATC 194 
Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val Asp He 

50 55 60 

TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC TGT 242 
Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser Cys 

65 70 75 

GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG GAG 290 
Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu Glu 
80 85 90 

TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG ATC 338 
Cys Val Pro Thr Glu Glu Ser Asn He Thr Met Gin He Met Arg He 
95 100 105 

AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG CAC 386 
Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin His 
110 115 120 125 

AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA GAA AAT 434 
Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin gIu Asn 

130 135 140 



CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT TTG TTT GTA CAA GAT 
Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val Gin Asp 

145 150 155 

CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT TGC AAG 
Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys Lys 
160 165 170 

GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AGA TGT GAC AAG CCG 
Ala Arg Gin Leu Glu Leu Asn Glu Arg ^hr Cys Arg Cys Asp Lys Pro 
175 180 

AGG CGG CCA TGG GTC ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC 

Arg Arg Pro Trp Val Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr 



482 



530 



578 
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190 195 200 205 

GCG GGT CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG 6 74 

Ala Gly Gin Tyr Ser Ser Phe Val Asp Isys lie Arg Asn Asn Val Lys 

210 215 220 

GAT CCA AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA 722 
Asp Pro Asn Leu Lys Tyr Gly Gly Thr Asp He Ala Val He Gly Pro 

225 230 235 

CCT TCT AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA 770 
Pro Ser Lys Glu bys Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly 
240 245 250 

ACG GTC TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT 818 
Thr Val Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr 
255 260 265 

CTT GCA ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA 866 
Leu Ala Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser 
270 275 280 285 

GAA ATT ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT 914 
Glu He Thr Ser Ala Glu X/eu Thr Ala Leu Phe Pro Glu Ala Thr Thr 

290 295 300 

GCA AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA 962 
Ala Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu 

305 310 315 

AAG AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG 1010 
Lys Asn Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly 
320 325 * 330 

TTG GGG ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG 1058 
Leu Gly He Asp Leu Leu Leu Thr Phe Met Glu AXat Val Asn Lys Lys 
335 340 345 

GCA CGT GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA 1106 
Ala Arg Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin 
350 355 360 365 

ATG ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT 1154 
Met Thr Aia Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr 

370 375 380 

AAG AAC TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT 1202 
Lys Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe 

385 390 395 

GAA GTC AGC TGG CGT AAG ATT TCT AC<3 GCA ATA TAC GGG GAT GCC AAA 1250 
Glu Val Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys 
400 405 410 



AAC GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG 



1298 
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Asn Gly Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly I>ys Val Arg 
415 420 425 

CAG GTG AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA 1346 
Gin Val Vys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro 
430 435 440 445 

AAG TAGTCAAACG AGGCCTGCAG 1369 
Lys 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 



30 



21 



(ii) MOLECULE TYPE: DNA (genomic) 
CATATGTGTGTCACATCAATCACATTAGAT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECUI*E TYPE: DNA (genomic) 
CAGGTTTGGA TCCTTTACGT T 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 

AAGGAGATATACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC AGC 4 

Met Gly Ser Ser His His His His His His Ser Ser 

1 5 10 

GGC CTG GTG CCG CGC GGC AGC CAT ATG CTC GAG GAT CCG ^ 
Gly Leu Val Pro Arg Gly Ser His Met Leu Glu Asp Pro 
15 20 25 



(2) INFORMATION FOR SEQ ID NO: 37: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIJOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
AAA CAACGTAAA A6A TCCAAA CCTGAAA 23 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 35 

(A) NAME/KEY: Cathepsin B linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
CCATGGCCCT GGCCCTGGCC CTGGCCCTGG CCATGG 36 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 3 . . 50 

(A) NAME/KEY: Cathepsin D linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
CCATGGGCCG ATCGGGCTTC CTGGGCTTCG GCTTCCTGGG CTTCGCCATGG 51 
(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 
(A) 1»ENGTH: 27 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECUl^ TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 26 

(A) NAME/KEY: Gly4Ser with Ncol ends 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
CCATGGGCGG CGGCGGCTCT GCCATGG ^'^ 
(2) INFORMATION FOR SEO ID NO:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 41 

(A) NAME/KEY: (Gly4Ser)2 with Ncol ends 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
CCATGGGCGG CGGCGGCTCT GGCGGCGGCG GCTCTGCCAT <3G 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 74 

(A) NAME/KEY: (Ser4Gly) 4 with Ncol ends 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42 
CCATGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCTC GTCGTCGTCG GGCTCGTCGT 60 

75 

CGTCGGGCGC CATGG 



wo 96/06641 



PCT/US95/10973 



141 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I.ENGTH:45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3., 45 

(A) NAME/KEY: (Ser4Gly) 2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
CCATGGCCTC GTCGTCGTCG GGCTCGTCGT CGTCGGGCGC CATGG 4 5 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 95 

(A) NAME/KEY: "Trypsin linker"' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
CCATGGGCCG ATCGGGCGGT GGGTGCGCTG GTAATAGAGT CAGAAGATCA GTCGGAAGCA 6 0 

GCCTGTCTTG CGGTGGTCTC GACCTGCAGG CCATGG 96 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 18 

(D) OTHER INFORMATION: /product^ Thrombin sxibstrate linker 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 

CTG GTG CCG CGC GGC AGC 
Leu Val Pro Arg Gly Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A> NAME/KEY: CDS 
CB) LOCATION: l.,15 
(D) OTHER INFORMATION: /product- Enterokinase substrate linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 

GAC GAC GAC GAC CCA 
Asp Asp Asp Asp Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 12 

(D) OTHER INFORMATION: /product^ Factor Xa substrate 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

12 

ATC GAA GGT CGT 
He Glu Gly Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQXJENCE CHARACTERISTICS: . 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : unknown 
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(ii) MOI#ECUX«E TYPE: peptide 

(ix) FEATURE; 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /product^ Flexible linker 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48 



Ala Ala Pro Ala Ala Ala Pro Ala 
1 5 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQXTENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..4 

(D) OTHER INFORMATION: /product^ subtilisin substrate linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 
Phe Ala His Tyr > 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..4 

(D) OTHER INFORMATION: /product^ subtilisin substrate linker 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

Xaa Asp Glu Leu 4 
1 



(2) INFORMATION FOR SEQ ID NO: 51: 



wo 96/06641 



PCTAJS95/10973 



144 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 
CCATGGCACC AATGGCAGAA GGAGGA 26 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 
GTCGACTCAT CACCGCCTCG GCTT 24 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 
CCATGGGCGG CGGCGGCTCT GCACCAATGG CAGAAGGA 38 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 53 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : un)aiown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 
CCATGGGCGG CGGCGGCTCT GGCGGCGGCG GCTCTGCACC AATGGCAGAA GGA 53 
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(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 
CCATGGCAGA GCCGCCGCCG CCCCGCCTCG GCTTGTCACA T 41 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 
CCATGGCAGA GCCGCCGCCG CCAGAGCCGC CGCCGCCCCG CCTCGGCTTG TCACAT 56 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1167 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 4.. 1155 

(D) OTHER INFORMATION: /products -SAP- (Gly4Ser) -VEGF121" 



(xi) SEQU3ENCE DESCRIPTION: SEQ ID NO: 57: 

CAT ATG GTC ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT 4 8 

Met Val Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly 
1 5 - 10 15 

CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA 96 
Gin Tyr Ser Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro 

20 25 30 



^^a^^^^ PCTAJS95/10973 
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AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT 144 
Asn Leu Lys Tyr Gly Gly Thr Asp lie Ala Val lie Gly Pro Pro Ser 

35 40 45 

AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC 192 
Lys Glu Lys Phe Leu Arg lie Asn Phe Gin Ser Ser Arg Gly Thr Val 
50 55 60 

TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA 240 
Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala 
65 70 75 

ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT 288 
Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu lie 
80 85 90 95 

ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT 336 
Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn 

100 105 110 

CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT 384 
Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn 

115 120 125 

GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG 432 
Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly 
130 135 140 

ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT 480 
He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg 
145 150 155 

GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA 528 
Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr 
160 165 170 175 

GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC 576 
Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn 

180 185 190 



TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC 
Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val 

195 200 205 

AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC 
Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly 
210 215 220 

GTG TTT AAT JU^ GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG 
Val Phe Asn Lys Asp Tyr Asp Phe Gly^Phe Gly Lys Val Arg Gin Val 
225 230 235 

AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG GCC 
Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys Ala 



624 



672 



720 



768 
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240 245 250 255 

ATG GGC GGC GGC GGC TCT GCC ATG GCA CCA ATG GCA GAA GGA GGA GGG 816 
Met Gly Gly Gly Gly Ser Ala Met Ala Pro Met Ala Glu Gly Gly Gly 

260 265 270 

CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG GAT GTC TAT CAG CGC AGC 864 
Gin Asn His His Glu Val Val Lys Phe Met Asp Val Tyr Gin Arg Ser 

275 280 285 

TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC ATC TTC CAG GAG TAC CCT 912 
Tyr Cys His Pro lie Glu Thr Leu Val Asp lie Phe Gin Glu Tyr Pro 
290 295 300 

GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC TGT GTG CCC CTG ATG CGA 960 
Asp Glu lie Glu Tyr lie Phe Lys Pro Ser Cys Val Pro Leu Met Arg 
305 310 315 

TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG GAG TGT GTG CCC ACT GAG 1008 
Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu Glu Cys Val Pro Thr Glu 
320 325 330 335 

GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG ATC AAA CCT CAC CAA GGC 1056 
Glu Ser Asn lie Thr Met Gin He Met Arg He Lys Pro His Gin Gly 

340 345 350 

CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG CAC AAC AAA TGT GAA TGC 1104 
Gin His He Gly Glu Met Ser Phe Leu Gin His Asn Lys Cys Glu Cys 

355 360 365 

AGA CCA AAG AAA GAT AGA GCA AGA CAA GAA AAA TGT GAC AAG CCG AGG 1152 
Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu Pys Cys Asp Lys Pro Arg 
370 375 380 

CGG TGATGAGTCG AC 1167 
Arg 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4.. 1287 

(D) OTHER INFORMATION: /product^ "SAP- {Gly4Ser ) - VEGF165 " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

CAT ATG GTC ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT 4 8 

Met Val Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly 
15 10 15 

CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA 96 
Gin Tyr Ser Ser Phe Val Asp Lys He Arg Asn Asn Val Lys Asp Pro 

20 25 30 

AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT 144 
Asn Leu Lys Tyr Gly Gly Thr Asp He Ala Val He Gly Pro Pro Ser 

35 40 45 

AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC 192 
Lys Glu Lys Phe Leu Arg He Asn Phe Gin Ser Ser Arg Gly Thr Val 
50 55 60 

TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA 240 
Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala 
€5 70 75 

ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT 288 
Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu He 
80 85 90 95 

ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT 336 
Thr Ser Ala Glu Leu Thr Ala l^u Phe Pro Glu Ala Thr Thr Ala Asn 

XOO 105 HO 

CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT 384 
Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys Asn 

115 120 125 

GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG 432 
Ala Gin He Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly 
130 135 140 



ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT 
He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg 
145 150 155 

GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA 
Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr 
160 165 170 175 

GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC 
Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn 

180 185 190 

TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC 
Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val 

195 200 205 



480 



528 



576 



624 
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AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC 6 72 

Ser Trp Arg Lys lie Ser Thr Ala lie Tyr Gly Asp Ala Lys Asn Gly 
210 215 220 

GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG 720 
Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val 
225 230 235 

AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG GCC 76B 
Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys Ala 
240 245 250 255 

ATG GGC GGC GGC GGC TCT GCC ATG GCA CCA ATG GCA GAA GGA GGA GGG 816 
Met Gly Gly Gly Gly Ser Ala Met Ala Pro Met Ala Glu Gly Gly Gly 

260 265 270 

CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG GAT GTC TAT CAG CGC AGC 864 
Gin Asn His His Glu Val Val Lys Phe Met Asp Val Tyr Gin Arg Ser 

275 280 285 

TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC ATC TTC CAG GAG TAC CCT 912 
Tyr Cys His Pro lie Glu Thr Leu Val Asp lie Phe Gin Glu Tyr Pro 
290 295 300 

GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC TGT GTG CCC CTG ATG CGA 960 
Asp Glu lie Glu Tyr He Phe Lys Pro Ser Cys Val Pro Leu Met Arg 
305 310 315 

TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG GAG TGT GTG CCC ACT GAG 1008 
Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu Glu Cys Val Pro Thr Glu 
320 325 330 335 

- 1| 

GAG TCC AAC ATC ACC ATG CAG AlT ATG CGG * ATC AAA CGT CAC CAA GGC 1056 

Glu Ser Asn He Thr Met Gin He Met Arg He Lys Pro His Gin Gly 

340 345 350 

CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG CAC AAC AAA TGT GAA TGC 1104 
Gin His He Gly Glu Met Ser Phe Leu Gin His Asn Lys Cys Glu Cys 

355 360 365 

AGA CCA AAG AAA GAT AGA GCA AGA CAA GAA AAT CCC TGT GGG CCT TGC 1152 
Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu Asn Pro Cys Gly Pro Cys 
370 375 380 

TCA GAG CGG AGA AAG CAT TTG TTT GTA CAA GAT CCG CAG ACG TGT AAA 1200 
Ser Glu Arg Arg Lys His Leu Phe Val Gin Asp Pro Gin Thr Cys Lys 
385 390 395 

TGT TCC TGC AAA AAC ACA GAC TCG CGT TGC AAG GCG AGG, CAG CTT GAG 124 8 

Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu 
400 405 .410 415 

TTA AAC GAA CGT ACT TGC AGA TGT GAC AAG CCG AGG CGG TGATGAGTCG 12 97 

Leu Asn Glu Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 

420 425 
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AC 



GTC TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT 
Val Ser Ueu Gly Leu Lys Arg Asp Asn l.eu Tyr Val Val Ala Tyr Leu 
65 70 75 

GCA ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT .TAC TTC AAA TCA GAA 
Ala Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu 
80 85 90 

ATT ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA 
lie Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala 

100 -105 110 

AAT CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG 
Asn Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser He Glu Lys 

115 120 125 



1299 



(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : both 

( D) TOPOLOGY : both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4.. 771 

(D) OTHER INFORMATION: /product = "SAP CYS -l** 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

CAT ATG TGT GTC ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG 4B 
Met Cys Val Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala 
1 5 10 15 

GGT CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT 96 
Gly Gin Tyr Ser Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp 

20 25 30 

CCA AAC CTG AAA TAC GGT GGT ACC GAC j^LTA GCC GTG ATA GGC CCA CCT 144 
Pro Asn Leu Lys Tyr Gly Gly Thr Asp He- Ala Val He Gly Pro Pro 

35 40 45 

TCT AAA GAA AAA TTC CTT AGA ATT AAT TTG CAA AGT TCC CGA GGA ACG 192 
Ser Lys Glu Lys Phe Leu Arg He Asn -Phe Gin Ser Ser Arg Gly Thr 
50 55 60 



240 



268 



336 



3B4 
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AAT GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG 4 32 

Asn Ala Gin lie Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly I*eu 
130 135 140 



GGG ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA 480 
Gly lie Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala 
145 150 155 

CGT GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG 528 
Arg Val Val Lys Asn Glu Ala Arg Phe Leu Leu lie Ala lie Gin Met 
160 165 170 175 



ACA GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG 576 
Thr Ala Glu Val Ala Arg Phe Arg Tyr lie Gin Asn Leu Val Thr Lys 

ISO 185 190 

AAC TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA 624 
Asn Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val lie Gin Phe Glu 

195 200 205 



GTC AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC 672 
Val Ser Trp Arg Lys lie Ser Thr Ala lie Tyr Gly Asp Ala Lys Asn 
210 215 220 

GGC GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG 72 0 

Gly Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin 
225 230 235 



GTG AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG 768 
Val Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys 
240 245 250 255 

TAG 771 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
TCCCAGGCTG CACCAATGGC AGAAGGAGGA 30 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 bases 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 
<ii) MOUECUIiE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY : CDS 

(B) LOCATION: 1-9 

(D) OTHER INFORMATION: /product^ "3' linking oligo for insertion 
into lla/15b" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
TCCTCCTTCT GCCATTGGTG CAGCCTGGGA 30 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CATATGAACT TTCTGCTGTC TTGG 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
CATATGGCAC CAATGGCAGA AGGAGGAGG 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

24 

GGATCCTCAT CACCGCCTCG GCTT 
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(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 
CCATGGCCGC CTCGGCTTGT C 
(2) INFORMATION FOR SEQ ID NO:66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 
GGATCCGAAA CATGAACTTT CTGCTGTCT 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 
GGATCCGAAA CATATGAACT TTCTGCTGTC T 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 
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CTGCAGTCAT CACCGCCTCG GCTT 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CATATGGTCA CATCATGTAC ATTAGATCTA GTAAAT 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CATATGGTCA CATCAATCAC ATTAGATCTA GTATGTCCGA CCGCGGGTCA 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cMlA 

(xi) SEQUENCE DESCIRIPTION: SEQ ID NO: 71: 
CTGCAGGCCT CCTTTGACTTA CTT 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 
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(A) NAME/KEY: CDS 

(B) liOCATION: 1..7 

iV) OTHER INFORMATION: /product = nuclear translocation sequence 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 



Ala Pro Arg Arg Arg Lys L»eu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
TTTCAGGTTT GGATCTTTTA CGTTGTTT 28 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
GGATCCGCCT CGTTTGACTA CTT 23 
(2) INFORMATION FOR SEQ ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

<ii) MOLECUX-E TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..6 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 



lie Lys Arg Leu Arg Arg 
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(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : s ingl e 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..6 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 76 

lie Lys Arg Gin Arg Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: xinknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(D) OTHER INFORMATION: /prbduct* "SO- A" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Val lie lie Ty Glu Leu Asn Leu Gin Gly Thr Thr Lys Ala Gin Tyr 

5 10 15 

Ser Thr lie Leu Lys Gin Leu Arg Asp Asp lie Lys Asp Pro Asn Leu 

20 25 30 

Xaa Tyr Gly Xaa Xaa Asp Tyr Ser 
35 40 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 4.. 154 5 

(X» OTHER INFORMATION: /products 

"SAP- (Gly4Ser) -VEGF121- (Gly4Ser) -VEGF121*' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

CAT ATG GTC ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT 4 8 

Met Val Thr Ser lie Thr Leu Asp Leu Val Asn Pro Thr Ala Gly 
1 5 10 IS 

CAA TAC TCA TCT TTT GTG GAT AAA ATC CGA AAC AAC GTA AAG GAT CCA 96 
Gin Tyr Ser Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro 

20 25 30 

AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT 144 
Asn Leu Lys Tyr Gly Gly Thr Asp lie Ala Val lie Gly Pro Pro Ser 

35 40 45 

AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC 192 
Lys Glu Lys Phe Leu Arg lie Asn Phe Gin Ser Ser Arg Gly Thr Val 
50 55 60 

TCA CTT GGC CTA AAA C6C GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA 240 
Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala 
65 70 75 

ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT 288 
Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu lie 
80 85 90 95 

ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT 336 
Thr Ser Ala Glu Leu Thr Ala Phe Pro Glu Ala Thr Thr Ala Asn 

100 105 110 

CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT 384 
Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn 

115 120 125 

GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG 4 32 

Ala Gin lie Thr Gin Gly Asp Lys Ser Arg Lys Glu Leu Gly Leu Gly 
13 0 13 5 14 0 

ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT 4 80 

lie Asp Leu l^u Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg 
145 150 155 

GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA 528 
Val Val Lys Asn Glu Ala Arg Phe Leu Leu lie Ala lie Gin Met Thr 
160 165 170 175 
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GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA -ACT AAG AAC 576 
Ala Glu Val Ala Arg Phe Arg Tyr lie Gin Asn Leu Val Thr Lys Asn 

180 185 190 

TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC 624 
Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val lie Gin Phe Glu Val 

195 200 205 

AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC 672 
Ser Trp Arg Lys lie Ser Thr Ala lie Tyr Gly Asp Ala Lys Asn Gly 
210 215 220 

GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG 720 
Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val 
225 230 235 

AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG GCC 768 
Lys Asp Leu Gin Met Gly Leu Leu Met Tyr Leu Gly Lys Pro Lys Ala 
240 245 250 255 

ATG GGC GGC GGC GGC TCT GCC ATG GCA CCA ATG GCA GAA GGA GGA GGG 816 
Met Gly Gly Gly Gly Ser Ala Met Ala Pro Met Ala Glu Gly Gly Gly 

260 265 270 

CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG GAT GTC TAT CAG CGC AGC 864 
Gin Asn His His Glu Val Val Lys Phe Met Asp Val Tyr Gin Arg Ser 

275 280 285 

TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC ATC TTC CAG GAG TAC CCT 912 
Tyr Cys His Pro lie Glu Thr Leu Val Asp He Phe Gin Glu Tyr Pro 
290 295 300 

GAT GAG ATC GAG TAC ATC TTC AAG CCA fCC TGT GTG CCC CTG ATG CGA ?60 
Asp Glu He Glu Tyr He Phe Lys Pro Ser Cys Val Pro Leu Met Arg 
305 310 315 

TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG GAG TGT GTG CCC ACT GAG 1008 
Cys Gly Gly Cys Cys TVsn Asp Glu Gly Leu Glu Cys Val Pro Thr Glu 
320 325 330 335 

GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG ATC AAA CCT CAC CAA GGC 1056 
Glu Ser Asn He Thr Met Gin He Met Arg He Lys Pro His Gin Gly 

340 345 350 



CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG CAC AAC AAA TGT GAA TGC 
Gin His He Gly Glu Met Ser Phe Leu Gin His Asn Lys Cys Glu Cys 

355 360 365 

AGA CCA AAG AAA GAT AGA GCA AGA CAA GAA AAA TGT GAC AAG CCG AGG 
Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu Lys Cys Asp Lys Pro Arg 

370 375 . 380 

CGG GCC ATG GGC GGC GGC GGC TCT GCC ATG GCA CCA ATG GCA GAA GGA 
Arg Ala Met Gly Gly Gly Gly Ser Ala Met Ala Pro Met Ala Glu Gly 
385 390 395 
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GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG GAT GTC TAT CAG 124 8 

Gly Gly Gin Asn His His Glu Val Val Lys Phe Met' Asp Val Tyr Gin 
400 405 410 415 

CGC AGC TAG TGC CAT CCA ATC GAG ACC CTG GTG GAC ATC TTC CAG GAG 1296 
Arg Ser Tyr Cys His Pro He Glu Thr Leu Val Asp He Phe Gin Glu 

420 425 430 

TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC TGT GTG CCC CTG 1344 
Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser Cys Val Pro Leu 

435 440 445 

ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG GAG TGT GTG CCC 1392 
Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu Glu Cys Val Pro 
450 455 460 

ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG ATC AAA CCT CAC 144 0 

Thr Glu Glu Ser Asn He Thr Met Gin He Met Arg He Lys Pro His 
465 470 475 

CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG CAC AAC AAA TGT 1488 
Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin His Asn Lys Cys 
480 485 490 495 

GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA GAA AAA TGT GAC AAG 1536 
Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu Lys Cys Asp Lys 

500 505 510 

CCG AGG CGG TGATGAGTCG AC 1557 
Pro Arg Arg 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1809 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: unJcnown 

(ii> MOLECULE TYPE: cDNA 



<ix) FEATURE; 

(A) NAME/KEY: CDS 

(B) LOCATION: 4.. 1797 

(D) OTHER INFORMATION: /product. 

"SAP- (Gly4Ser) -VEGF165- (Gly4Ser) -VEGF165" 



(Xi) SKQVENCE DESCRIPTION: SEO ID NO: 79: 

CAT ATG GTC ACA TCA ATC ACA TTA GAT CTA GTA AAT CCG ACC GCG GGT 48 
Met Val Thr Ser He Thr Leu Asp Leu Val Asn Pro Thr Ala Gly 
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1 5 10 ' 15 

CAA TAC TCA TCT TTT GTG GAT AAA ATC C6A AAC AAC GTA AAG GAT CCA 96 
Gin Tyx Ser Ser Phe Val Asp Lys lie Arg Asn Asn Val Lys Asp Pro 

20 25 30 

AAC CTG AAA TAC GGT GGT ACC GAC ATA GCC GTG ATA GGC CCA CCT TCT 144 
Asn Leu Lys Tyr Gly Gly Thr Asp lie Ala Val lie Gly Pro Pro Ser 

35 40 45 

AAA GAA AAA TTC CTT AGA ATT AAT TTC CAA AGT TCC CGA GGA ACG GTC 192 
Lys Glu Lys Phe Leu Arg lie Asn Phe Gin Ser Ser Arg Gly Thr Val 
50 55 60 

TCA CTT GGC CTA AAA CGC GAT AAC TTG TAT GTG GTC GCG TAT CTT GCA 240 
Ser Leu Gly Leu Lys Arg Asp Asn Leu Tyr Val Val Ala Tyr Leu Ala 
65 70 75 

ATG GAT AAC ACG AAT GTT AAT CGG GCA TAT TAC TTC AAA TCA GAA ATT 288 
Met Asp Asn Thr Asn Val Asn Arg Ala Tyr Tyr Phe Lys Ser Glu lie 
80 85 90 95 

ACT TCC GCC GAG TTA ACC GCC CTT TTC CCA GAG GCC ACA ACT GCA AAT 336 
Thr Ser Ala Glu Leu Thr Ala Leu Phe Pro Glu Ala Thr Thr Ala Asn 

100 105 110 

CAG AAA GCT TTA GAA TAC ACA GAA GAT TAT CAG TCG ATC GAA AAG AAT 384 
Gin Lys Ala Leu Glu Tyr Thr Glu Asp Tyr Gin Ser lie Glu Lys Asn 

115 120 125 

GCC CAG ATA ACA CAG GGA GAT AAA AGT AGA AAA GAA CTC GGG TTG GGG 432 
Ala Gin He Thr Gin Gly Asp Lys Ser Ajcg Lys Glu Leu Gly L»eu Gly 
130 135 140 

ATC GAC TTA CTT TTG ACG TTC ATG GAA GCA GTG AAC AAG AAG GCA CGT 480 
He Asp Leu Leu Leu Thr Phe Met Glu Ala Val Asn Lys Lys Ala Arg 
145 150 155 

GTG GTT AAA AAC GAA GCT AGG TTT CTG CTT ATC GCT ATT CAA ATG ACA 528 
Val Val Lys Asn Glu Ala Arg Phe Leu Leu He Ala He Gin Met Thr 
160 165 170 175 

GCT GAG GTA GCA CGA TTT AGG TAC ATT CAA AAC TTG GTA ACT AAG AAC 576 
Ala Glu Val Ala Arg Phe Arg Tyr He Gin Asn Leu Val Thr Lys Asn 

180 185 190 

TTC CCC AAC AAG TTC GAC TCG GAT AAC AAG GTG ATT CAA TTT GAA GTC 624 
Phe Pro Asn Lys Phe Asp Ser Asp Asn Lys Val He Gin Phe Glu Val 

195 200 205 

AGC TGG CGT AAG ATT TCT ACG GCA ATA TAC GGG GAT GCC AAA AAC GGC 672 
Ser Trp Arg Lys He Ser Thr Ala He Tyr Gly Asp Ala Lys Asn Gly 

210 215 220 



GTG TTT AAT AAA GAT TAT GAT TTC GGG TTT GGA AAA GTG AGG CAG GTG 



720 
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Val Phe Asn Lys Asp Tyr Asp Phe Gly Phe Gly Lys Val Arg Gin Val 
225 230 235 

AAG GAC TTG CAA ATG GGA CTC CTT ATG TAT TTG GGC AAA CCA AAG GCC 768 
liys Asp Leu Gin Met Gly I*eu Leu Met Tyr Leu Gly Lys Pro Lys Ala 
240 245 250 255 

ATG GGC GGC GGC GGC TCT GCA CCA ATG GCA GAA GGA GGA GGG CAG AAT 816 
Met Gly Gly Gly Gly Ser Ala Pro Met Ala Glu Gly Gly Gly Gin Asn 

260 265 270 

CAT CAC GAA GTG GTG AAG TTC ATG GAT GTC TAT CAG CGC AGC TAC TGC 864 
His His Glu Val Val Lys Phe Met Asp Val Tyr Gin Arg Ser Tyr Cys 

275 280 285 

CAT CCA ATC GAG ACC CTG GTG GAC ATC TTC CAG GAG TAC CCT GAT GAG 912 
His Pro He Glu Thr Leu Val Asp He Phe Gin Glu Tyr Pro Asp Glu 
250 295 300 

ATC GAG TAC ATC TTC AAG CCA TCC TGT GTG CCC CTG ATG CGA TGC GGG 96 0 

He Glu Tyr lie Phe Lys Pro Ser Cys Val Pro Leu Met Arg Cys Gly 
305 310 315 

GGC TGC TGC AAT GAC GAG GGC CTG GAG TGT GTG CCC ACT GAG GAG TCC 1008 
Gly Cys Cys Asn Asp Glu Gly Leu Glu Cys Val Pro Thr Glu Glu Ser 
320 325 330 335 

AAC ATC ACC ATG CAG ATT ATG CGG ATC AAA CCT CAC CAA GGC CAG CAC 1056 
Asn He Thr Met Gin He Met Arg He Lys Pro His Gin Gly Gin His 

340 345 350 

ATA GGA GAG ATG AGC TTC CTA CAG CAC AAC AAA TGT GAA TGC AGA CCA ll04 
He Gly Glu Met Ser Phe Leu Gin His Asn Lys Cys Glu Cys Arg Pro 

355 360 365 

AAG AAA GAT AGA GCA AGA CAA GAA AAT CCC TGT GGG CCT TGC TCA GAG 1152 

Lys Lys Asp Arg Ala Arg Gin Glu Asn Pro Cys Gly Pro Cys. Ser Glu 
370 375 380 

CGG AGA AAG CAT TTG TTT GTA CAA GAT CCG CAG ACG TGT AAA TGT TCC 1200 
Arg Arg Lys His Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys Ser 
385 390 395 

TGC AAA AAC ACA GAC TCG CGT TGC AAG GCG AGG CAG CTT GAG TTA AAC 1248 
Cys Lys Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu Leu Asn 
400 405 410 415 

GAA CGT ACT TGC AGA TGT GAC AAG CCG AGG CGG GGC GGC GGC GGC TCT 1296 
Glu Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg Gly Gly Gly Gly Ser 

420 425 430 

GCC ATG GCA CCA ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG 1344 
Ala Met Ala Pro Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val 

435 440 445 
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GTG AAG TTC ATG GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG 1392 
Val Lys Phe Met Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu 
450 455 460 

ACC CTG GTG GAC ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC 144 0 

Thr Leu Val Asp He Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He 
465 470 475 

TTC AAG CCA TCC T6T GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT 14 B8 

Phe Lys Pro Ser Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn 
480 485 490 495 

GAC GAG GGC CTG GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG 1536 
Asp Glu Gly Leu Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr Met 

500 505 510 

CAG ATT ATG CGG ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG 1584 
Gin He Met Arg He Lys Pro His Gin Gly Gin His He Gly Glu Met 

515 520 525 

AGC TTC CTA CAG CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA 1632 
Ser Phe Leu Gin His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg 
530 535 540 

GCA AGA CAA GAA AAT CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT 1680 
Ala Arg Gin Glu Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His 
545 550 555 

TTG TTT GTA CAA GAT CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA 1728 
Leu Phe Val Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr 
560 565 570 575 

GAC TCG CGT TGC AAG GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC 1776 
Asp Ser Arg Cys Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys 

580 585 590 

AGA TGT GAC AAG CGG AGG CGG TGATGAGTCG AC 1809 
Arg Cys Asp Lys Pro Arg Arg 

595 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : unknown 

<ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 
TGGTCCCAGGCTGCACCC ATGTGTGAAGGAGGAGGGCAGAATCAT 3° 



(2) INFORMATION FOR SEQ ID NO: 81: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 
ATG_ATT ' CTGCCCTCCTCCTTCACACATG GGTGCAGCCTGGGACCA 30 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 
GCCAAGTGGTCCCAGG CTGCATGTCCCATGGCAGAAGG AGGAGGGCAG 3 0 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid r> 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
CTGCCCTCCTCCTTCTGCCATGGG ACATGCAGCCTGGGACCACTTGGC 3 0 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. . 5 

(D) OTHER INFORMATION: /products nuclear translocation sequence 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 

lie Arg Val Arg Arg 
1 5 

(2) INFORMATICS FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : xinknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 

(D) OTHER INFORMATION: /product" nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 

Lys Arg Lys Arg Lys Lys 
1 S 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS : single ^ 

(D) TOPOLOGY: unknovro 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 12. .455 

(D) OTHER INFORMATION: /product = "VEGF121 Cys +4- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

GGATCCGAAA C ATG AAC TTT CTG CTG TCT TGG. GTG CAT TGG AGC CTT GCC 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu Ala 
1 5 10 

TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC ATG 
Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Met 
15 20 25 

TGT GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG 
Cys Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 
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30 35 40 ' 45 

GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC 194 
Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val Asp 

50 55 60 

ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC 242 
lie Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro Ser 

65 70 75 

TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG 290 
Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu 
80 85 90 

GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG 3 38 

Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr Met Gin He Met Arg 
95 100 105 

ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG 386 
He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin 
110 115 120 125 

CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA GAA 434 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 

130 135 140 

AAA TGT GAC AAG CCG AGG CGG TGATGACTGC AG 467 

Lys Cys Asp Lys Pro Arg Arg 

145 



(2) INFORMATION FOR SEQ ID NO: 87:- > 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
C D ) TOPOLOGY : unknown 



(ii) MOLECOLE TYPE: cDNA 



(ix) FEATURE: 

<A) NAME/KEY: CDS 

(B) LOCATION: 12.. 587 

(D) OTHER INFORMATION: /product^ "VEGF165 Cys i^4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

GGATCCGAAA C ATG AAC TTT CTG CTG TCT TGG -GTG CAT TGG AGC CTT GCC 50 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu Ala 
1 5 10 

TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA CCC ATG 98 
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beu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Met 
15 20 25 

TGT GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC AT6 146 
Cys Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 
30 35 40 45 

GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC 194 
Asp Val Tyr Gin Arg Ser Tyr Cys His Pro He Glu Thr Leu Val Asp 

50 55 SO 

ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC 242 
He Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 

65 70 75 

TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG 290 
Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu 
80 85 90 

GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG 338 
Glu Cys Val Pro Thr Glu Glu Ser Asn He Thr Met Glh He Met Arg 
95 100 105 

ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG 386 
He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu Gin 
XXO 115 120 125 

CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA GAA 434 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 

X30 135 140 

AAT CCC TGT GGG CCT TGC TCA GAG CGG A^A AAG CAT TTG TTT GTA CAA 482 

Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val Gin 

X45 ISO 155 



GAT CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT TGC 
Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys 
X60 165 170 

AAG GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AGA TGT GAC AAG 
Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp Lys 
X75 180 185 

CCG AGG CGG TGATGACTGC AG 

Pro Arg Arg 

190 



530 



578 



599 



(2) INFORMATION FOR SEO ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 
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Cii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13.. 456 

(D) OTHER INFORMATION: /product^ "VEGF121 Cys+2 with Ncol 
sites" 

(ix) FEATURE: 

(A) NAME/KEY: misc_recofnb 

(B) LOCATION: 1. .6 

(D) OTHER INFORMATION: /note* "NcoI restriction site- 

(ix) FEATURE: 

(A) NAME/KEY: misc^recomb 

(B) LOCATION: 98 . . 103 

(D) OTHER INFORMATION: /note= "NcoI restriction site" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

GGATCCGAAA CC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT 48 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu 
1 5 10 

GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA TGT 96 
Ala Leu Leu Leu Tyx Leu His Hi^ Ala Lys Trp Ser Gin Ala Ala Cys 
15 20 25 

CCC ATG GCA GAA 6GA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC 144 
Pro Met Ala Glu Gly Gly Gly Gin Asn His ifis Glu Val Val Lys Phe 
30 35 40 

ATG GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG 192 
Met Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu Thr Z^eu Val 
45 50 55 60 

GAC ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA 240 
Asp lie Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro 

65 70 75 

TCC TGT GTG CCC CTG ATG C<3A TGC GGG GGC TGC TGC AAT GAC GAG GGC 288 
Ser Cys Val Pro l*eu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly 

80 85 90 

CTG GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG 3 36 

Leu Glu Cys Val Pro Thr Glu Glu Ser Asn lie Thr Met Gin He Met 
95 100 105 

CGG ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA 384 
Arg He Lys Pro His Gin Gly Gin His He Gly Glu Met Ser Phe Leu 
110 115 120 



CAG CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA 
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Gin His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin 
125 130 135 140 

GAA AAA TGT GAC AAG CCG AGG CGG 456 
Glu Lys Cys Asp Lys Pro Arg Arg 

145 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 12.. 587 

(D) OTHER INFORMATION: /product^ "VEGF165 Cys+2 with Ncol 
sites" 



(ix) FEATURE: 

(A) NAME/KEY: misc_recomb 

(B) LOCATION: 1,.6 

(D) OTHER INFORMATION: /note« "NcoI restriction site" 

(ix) FEATURE: 

(A) NAME/KEY: misc_recoinb 

(B) LOCATION: 97,. 102 

(D) OTHER INFORMATION: /notife» "NcoI restriction site" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GGATCCGAAA C ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC CTT GCC 50 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu Ala 
1 5 10 

TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA TGT CCC 98 
Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Cys Pro 
15 20 25 

ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC ATG 146 
Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe Met 
30 35 40 45 

GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG GAC 194 
Asp Val Tyr Gin Arg Ser Tyr Cys His Pro He Glu Thr Leu Val Asp 

50 55 60 



ATC TTC CAG GAG TAC. CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA TCC 
He Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro Ser 
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65 70 75 

TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC CTG 2 90 

Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu 
80 85 90 

GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG CGG 338 
Glu Cys Val Pro Thr Glu Glu Ser Asn lie Thr Met Gin lie Met Arg 
95 100 105 

ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA CAG 386 
lie Lys Pro His Gin Gly Gin His lie Gly Glu Met Ser Phe Leu Gin 
110 115 120 125 

CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA GAA 434 
His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu 

130 135 140 

AAT CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT TTG TTT GTA CAA 4 82 

Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val Gin 

145 150 155 

GAT CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT TGC 530 
Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys 
160 165 170 

AAG GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AGA TGT GAC AAG 578 
Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp Lys 
175 180 185 

CCG AGG CGG TGATGACTGC AG 599 

Pro Arg Arg / 

190 

(2) INFORMATION FOR SEO It> NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : luiknown 

(ii) MOLECXJLE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .7 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 

Pro Lys Lys Arg Lys Val Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 91: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : s ingl e 

( D ) TOPOLOGY : \anknown 

(ii> MOLECULE TYPE: peptide 

(ix) FEATURE: 

<A) NAME/KEY: CDS 
(B) LOCATION: X..8 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 

Pro Pro Lys Lys Ala Arg Glu Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: uinknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..9 

(D) OTHER INFORMATION: /product* nuclear translocation sequence 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 

Pro Ala Ala Lys Arg Val Lys Leu Asp 
15 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cnown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.-5 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 
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Lys Arg Pro Arg Pro 
15 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

Cii) MOLJBCULE TYPE: peptide 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..5 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 

Lys lie Pro lie Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

-■^ . • ■ 

# ■ 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..9 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 

Gly Lys Arg Lys Arg Lys Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: un)cnown 

(ii) MOLECUI*E TYPE: peptide 
(ix) FEATURE: 
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(A) NAME/KEY: CDS 

(B) liOCATION: 1..9 

CD) OTHER INFORMATION; /product* nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 

Ser I#ys Arg Val Ala Lys Arg Lys leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I»ENGTH: 9 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: xmknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..9 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 

Ser His Trp Lys Gin Lys Arg Lys Phe 
1 5 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .8 

(D) OTHER INFORMATION: /product= nuclear translocation sequence 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 

Pro Leu Leu Lys Lys lie Lys Gin 
1 5 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: unknown 
(ii) MOUSCULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .7 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99 

Pro Gin Pro Lys Lys Lys Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 100 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .15 

(D) OTHER INFORMATION: /product^ nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:100 

Pro Gly Lys Arg Lys Lys Glu Met Thr Lys Gjji Lys Glu Val Pro 
15 10 15 

C2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOIXXry: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 12 

(D) OTHER INFORMATION: /products nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 

Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Ala Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO: 102: 



wo 96A)6641 PCT/US95/10973 



174 



(i) SEQUENCE CHARACTERISTICS: 

(A) LiENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) KAME/KEY: CDS 

(B) LOCATION: 1-.7 

(D) OTHER INFORMATION: /product* nuclear translocation sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 

Asn Tyx Lys Lys Pro Lys Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..7 ;> 

(D) OTHER INFORMATION: /product* nuclear translocation sequence 

(xi) SEQUENCE DESC^RIPTION: SEQ ID NO: 103 

His Phe Lys Asp Pro Lys Arg 
1 5 



\\main\groupvwp\7M)ioo\pzm4I3k:\sequence listing 
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Claims 



We claim: 



1. A conjugate, comprising a targeted agent and a vascular endothelial 
cell growth factor (VEGF) polypeptide or a portion thereof, wherein the conjugate binds to a 
VEGF receptor resulting in internalization of the linked targeted agent. 

2. A conjugate comprising the following components: (VEGF)n, (L)q and 
(targeted agent)in, wherein: 

L is a linker; 

VEGF is a VEGF monomer or a portion thereof; 

at least one VEGF monomer is linked at any residue via (L)q to at least one 

targeted agent; 

m and n, which are selected independently, are at least 1 ; 

q is 0 or more as long as the resulting conjugate binds to the targeted receptor 
is internalized and delivers the targeted agent; and 

the conjugate binds to a receptor that interacts with and internalizes VEGF, 
whereby the targeted agent(s) is internalized in a cell bearing the receptor. 

3. The conjugate of claim 2, wherein m and n, which are selected 
independently, are 1-6. 

4. The conjugate of claim 2, ^^erein n is L 

5. The conjugate of claim 4, wherein m is 1 . 

6. The conjugate of claim 2, wherein q is 1 , 

7. The conjugate of claim 2, wherein L is selected from the group 
consisting of protease substrates, linkers that increase the flexibility of the conjugate, linkers 
that increase the solubility of the conjugate, photocleavable linkers and acid cleavable linkers. 

8- A conjugate of any one of claims 1 or 2. wherein a VEGF polypeptide 
is selected from the group consisting of VEGF121, VEGF120, VEGFigg, VEGF 189. 
VEGF164. VEGF165/ VEGF205. VEGF2O6 and a modified VEGF121. VEGFi20- 
VEGF 1 88^ VEGF 189, VEGF 1 64. VEGF 1 65, VEGF205 or VEGF2O6 in which a cysteine 
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residue is added or replaces a non-essential amino acid residue within about 20 amino acids 
of the N-terminus or C-terminus of the monomer. 

9. The conjugate of any one of claims 1 or 2, wherein the targeted agent 
is a cytotoxic agent. 

10. The conjugate of any one of claims 1 or 2, wherein the targeted agent 
is a ribosome-inactivating protein. 

1 1 . The conjugate of claim 1 0, wherein the targeted agent is a saporin. 

1 2. The conjugate of any one of claims 1 or 2, wherein the targeted agent 
is a nucleic acid. 

13. The conjugate of any one of claims 1 or 2, wherein the targeted agent 
is an antisense nucleic acid. 

14. The conjugate of claim 2, wherein the conjugate is a fusion protein 
selected from the group consisting of FPVSl, FPSVl, FPSV2, FPSV3, FPSV4, FPSV5, 
FPSV6, FPSV7, FPSV8, FPSWl, FPSVV2, FPSW3, FPSVV4, FPSW5, FPSVV6, 
FPSW7 and FPSW8. 

1 5 . The conjugate of claim 2 that has the formula: 
targeted agent-(L)q-VEGF-(L)rVEGF, wherein 

q and r, which may be the same or different, are 0 or 1 . 

1 6. A conjugate that has the formula: 
(targeted agent)m-(L)q-(VEGF)n, wherein 
L is a linker; 

VEGF is a VEGF monomer or a portion thereof; 

at least one VEGF monomer is linked at any residue via (L)q to at least one 

targeted agent; 

m and n. which are selected independently, are at least 1 ; 
q is 0 or more as long as the resulting conjugate binds to the targeted receptor 
is internalized and delivers the targeted agent; and 
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the conjugate binds to a receptor that interacts with and internalizes VEGF, 
whereby the targeted agent(s) is internalized in a cell bearing the receptor. 

17. The conjugate of any one of claims 1-16, for use as an active 
therapeutic substance. 

18. The conjugate of any one of claims 1-1 6» for use in the manufacture of 
a medicament for treating a VEGF-mediated pathophysiological condition. 

19. The conjugate of claim 1 8, wherein the pathophysiological condition is 
a dermatological disorder with underlying vascular proliferation, a solid tumon or an 
ophthalmic disorder of the hyperproliferating blood vessels of the retina, iris, conjunctiva or 
vitreous humor. 



20. The conjugate of any one of claims 1-16, for inhibiting proliferation of 
cells bearing VEGF receptors, 

21. The conjugate of claim 12, for effecting gene therapy, wherein the 
conjugate includes a nuclear translocation sequence operatively linked to the targeted nucleic 
acid or VEGF. 

22. A DNA fragment comprising a sequence of nucleotides encoding the 
conjugate of any one of claims 1-6 and 8-16. 

23. A plasmid, comprising the DNA of claim 22. 

24. A plasmid of claim 23, wherein the plasmid is an expression vector for 
expression of the DNA encoding the conjugate in eukaryotic cells or is an expression vector 
for expression of the conjugate in prpkaryotic cells. 

25. The plasmid of claim 24, wherein the vector is pPl-A.. 

26. A plasmid of claim 23, selected from the group consisting of PZ72B1. 
PZ73BK PZ74B1, PZ74F5, PZ75B1, PZ75F5; PZ76B1, PZ76F5. PZ77BL PZ78F5. 
PZ79B1, PZ79F5, PZ80B1, PZ81B1, PZ81F5, PZ82B1, PZ83BK PZ84BL PZ85BK 
PZ85F5, PZ86B1, PZ95BL PZ96BL PZ97B1, PZ98BL PZ99BK PZIOOBL PZIOIBI. 
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PZ102B1, PZ103B1, PZ104BU PZ105B1. PZ106I1, PZ107I1, PZ108I1, PZ109J1, PZllOJl, 
PZl 1 IJl, PZl 12J1, PZl 13J1 and PZl 14JL 

21. A cell transfected or transformed with the expression vector of claim 

24. 

28. The cell of claim 27 that is a bacterial cell. 

29. A method of producing a conjugate of any one of claims 1-6 and 8-16. 
comprising culturing the cells of claim 27 under conditions whereby DN A is transcribed and 
translated to produce the conjugate. 

30. A vascular endothelial cell growth factor monomer that is modified by 
insertion of a cysteine residue within about twenty amino acids of the N-terminus or C- 
terminus, wherein the inserted residue replaces a nonessential residue in the immodified 
VEGF monomer or is added to the VEGF monomer. 

31. The modified monomer of claim 30, wherein the cysteine residue is 
inserted within about 20 residues of the N-terminus. 

32. The VEGF monomer, of claim 46 that is VEGF CYS+4, 
VEGF CYS+2, or VEGF CYS-l . 

33. DNA encoding the VEGF monomer of claim 30. 

34. A pharmaceutical composition, comprising the conjugate of any one of 
claims 1-16, in combination with a physiologically acceptable excipient. 

35. A method of producing a VEGF fusion protein comprising: 

(a) culturing cells transformed with a plasmid comprising pPf^* 
containing a DNA fiagment according to claim 22, under conditions whereby the DNA 
fragment is transcribed and translated; 

(b) lysing the cells to release inclusion bodies; 

(c) solubilizing the inclusion bodies in a denaturant; and 

(d) removing the denaturant. thereby refolding the fusion protein. 



36. A method of producing VEGF. comprising: 
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(a) culturing cells transformed with a plasmid comprising pPl-X 
containing a DNA fragment encoding VEGF, imder conditions whereby the DNA fragment 
transcribed and translated; 

(b) lysing the cells to release inclusion bodies; 

(c) solubilizing the inclusion bodies in a denaturant; and 

(d) removing the denaturant, thereby refolding the fusion protein. 
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